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METHODS FOR DELIVERING NUCLEIC ACID MOLECULES INTO CELLS AND 

ASSESSMENT THEREOF 

RELATED APPLICATIONS 

5 This application is related to: U.S. application Serial No. 09/8 1 5,979 entitled 

"METHODS FOR DELIVERING NUCLEIC ACID MOLECULES INTO CELLS AND 
ASSESSMENT THEREOF", filed on March 22, 2001 by De Jong et al. ; U.S. application 
Serial No. 09/815,981 entitled "METHODS FOR DELIVERING NUCLEIC ACID 
MOLECULES INTO CELLS AND ASSESSMENT THEREOF", filed on March 22, 2001 

10 by De Jong et al. ; U.S. application Serial No. 1 0/086,745 entitled "METHODS FOR 
DELIVERING NUCLEIC ACID MOLECULES INTO CELLS AND ASSESSMENT 
THEREOF", filed on February 28, 2002 by De Jong et al. ; PCT International Application No. 
PCT/US02/09262 entitled "METHODS FOR DELIVERING NUCLEIC ACID 
MOLECULES INTO CELLS AND ASSESSMENT THEREOF", filed on March 22, 2002 

15 by De Jong et al. ; and, U.S . application Serial No. 60/3 77,547 entitled "METHODS FOR 
DELIVERING NUCLEIC ACID MOLECULES INTO CELLS AND ASSESSMENT 
THEREOF", filed on May 1, 2002 by De Jong et al. The subject matter of each of these 
applications is incorporated by reference in its entirety. 

20 FIELD OF THE INVENTION 

The present invention relates to methods of delivering nucleic acid molecules into 
cells and methods for measuring nucleic acid delivery into cells and the expression of the 
nucleic acids therein. 

25 BACKGROUND OF THE INVENTION 

A number of methods for the delivery of nucleic acid molecules, particularly plasmid 
DNA and other small fragments of nucleic acid, into cells have been developed. Many of 
these methods are not ideal for delivery of larger nucleic acid molecules. Thus, there is a 
need for methods of delivering nucleic acid molecules of increasing size and complexity, 
30 such as artificial chromosomes, into cells. Methods are required for use with in vitro and in 
vivo procedures such as gene therapy and for production of transgenic animals and plants. 
Furthermore, there is a need for the ability to determine and assess the efficiency of delivery 
of DNA into cells. 
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SUMMARY OF THE INVENTION 

In various aspects, the invention provides methods for delivering nucleic acid 
molecules, particularly larger molecules, including artificial chromosomes, into cells. 
Methods for assessing delivery are also provided. Selected methods may be used to deliver 
nucleic acid molecules and/or proteins of a range of sizes, and may be particularly suitable 
for delivery of larger nucleic acid molecules, such as natural and artificial chromosomes with 
associated proteins and fragments thereof, into cells. Alternative methods may be designed 
for in vitro and ex vivo delivery of nucleic acid molecules for applications, including, but not 
limited to, delivery of nucleic acid molecules to cells for cell-based protein production, 
transgenic protein production and gene therapy. Methods of protein production in cells and 
in transgenic animals and plants, methods of introducing nucleic acid into cells to produce 
transgenic animals and plants, and methods for ex vivo and gene therapy are also provided. 

Methods provided herein are designed for delivering a large nucleic acid molecule 
with or without associated proteins into a cell, but may also be used to deliver smaller 
molecules. Selected methods provided herein are designed for improving transfer of nucleic 
acid into the cells by delivering the nucleic acid into cells that are in a pre-selected phase. 
Cells of a pre-selected phase can be obtain in a number of ways. For example, cells can be 
isolated at a specific cell cycle stage using mechanical means such as elutriation or by 
exposing the cell to a cell cycle arrest agent or an agent that facilitates cell cycle 
synchronization: The agent can be one or more compositions, conditions and/or physical 
treatments that facilitates cell arrest and/or results in a synchronized population of cells. 
Selected agents and combinations thereof can include, for example, those that result in the 
highest transfection efficiency, highest delivery efficiency and/or the greatest amount of 
intact nucleic acid molecules transferred into the cell nucleus with an acceptable degree of 
cell survival. Thus, for example, the nucleic acid can be transferred to the cell at a phase or 
point in a phase of a cell that results in improved transfection efficiency, improved delivery 
efficiency and/or an improved proportion of intact nucleic acid molecules within the 
transfected cells (such improvements being relative to cells at a diffent phase or point in a 
phase of the cell cycle). 

The selected methods may vary depending on the target cells (cells into which nucleic 
acid is delivered), the nucleic acid molecules, and methods used for obtaining cells in 
selected phases. Exemplary methods for delivery of large nucleic acid molecules into cells 
provided herein include obtaining a cell that is in a pre-selected phase of the cell cycle then 
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contacting the cell with a large nucleic acid molecule. The pre-selected phase of the cell may 
be determined by introducing a large nucleic acid into cells at different phases and 
determining and comparing the efficiency of delivery and/or of transfection of the nucleic 
acid into the cells at different phases and selecting a phase at which the efficiency of delivery 
5 and/or of transfection is increased relative to efficiency of delivery and/or transfection at 
other phases. 

The pre-selected phase may be obtained by exposing one or more cells to a cell cycle 
arrest agent, exposing the cell(s) to conditions that permit cell cycling, and then selecting a 
cell that is in the pre-selected phase. Arrest agents may include, but are not limited to anti- 
1 0 microtubule agents such as nocodazole and agents that effect DNA replication such as 

thymidine. The pre-selected phase can be any phase including GO, Gl, S, G2, M, or at the 
interface of two phases including G0/G1, G2/M, Gl/S, and S/G2. In an exemplary method, 
the phase is one in which the nuclear membrane of the cell is absent. 

The methods provided herein may be adapted for delivery of large nucleic acid 
1 5 molecules into cells in a variety of environments for a variety of purposes. For example, 

nucleic acid molecules greater than about 0.6, 1, and 5 megabase pairs may be delivered into 
cells using the methods provided herein. 

Included among the nucleic acid molecules that may be delivered into cells using 
alternative embodiments of the methods provided herein are natural chromosomes, artificial 
20 chromosomes, artificial chromosome expression systems (ACes), or fragments of any of 
these chromosomes, such as fragments that are greater than about 0.6 megabase or naked 
DNA that is greater than about 0.6 megabase. 

In particular embodiments of the methods a delivery agent may be used. Delivery 
agents may include a cationic compound. Cationic compounds include, but are not limited 
25 to, a cationic lipid, a cationic polymer, a mixture of cationic lipids, a mixture of cationic 
polymers, a mixture of a cationic lipid and a cationic polymer and a mixture of a cationic 
lipid and a neutral lipid, polycationic lipids, non-liposomal forming lipids, activated 
dendrimers, pyridinium chloride surfactants, ethanolic cationic lipids, and cationic 
amphiphiles. Examples of cationic lipid compounds include N-[l-(2,3-dioleyloxy)propyl]- 
30 N,N,N-trimethylammonium chloride (DOTMA), dioleoylphosphatidylethanolamine (DOPE), 
2,3-dioleyloxy-N-[2(spenrime-carboxamido)emyl]-N,N-dimethyl- 1 - 
propanaminiumtrifluoroacetate (DOSPA), C52H106N6O44CF3CO2H, 
C 88 H m N 8 0 4 S 2 .4CF 3 C0 2 H, C 40 H 8 4NO 3 P.CF 3 CO 2 H, C 5 oHio3N703.4CF 3 C0 2 H, 
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CssHneNgOa.eCFaCOaH, C49H102N6O3.4CF3CO2H, C44H89N5O3.2CF3CO2H, 
CiooH206N 12 04S2.8CF3C02H 3 C4iH 7 8N0 8 P) Ci 6 2H33oN220 9 .13CF3C0 2 H, 
C 4 3H 8 8N402.2CF 3 C02H, C 4 3H88N 4 03.2CF 3 C02H, and (l-methyl-4-(l-octadec-9-enyl- 
nonadec-10-enylenyl) pyridinium chloride. 

5 In particular embodiments of the methods the nucleic acid molecule can be exposed to 

an agent that increases contact between the nucleic acid molecule and the cell and the cell 
may be exposed to an agent that enhances permeability of the cell. The permeability 
enhancing agent may include energy such as ultrasound or electrical energy. 

In alternative embodiments the invention may be utilized to introduce nucleic acids 

1 0 into a wide range of cell types, eukaryotic and prokaryotic, including cell lines, primary cells, 
primary cell lines, plant cells, and animal cells, including embryonic cells, nuclear transfer 
donor cells, stem cells, primary cells, immortalized cells, cells from immortalized cell lines, 
tumor cells, transformed cells, and cells that are capable of the generation of a specific organ. 
Animal cells may include mammalian cells, rodent cells or human cells. Exemplary cells for 

15 use in the methods provided herein include fibroblasts, synoviocytes, and fibroblast-like 
synoviocyte. 

DETAILED DESCRIPTION OF THE INVENTION 
A. DEFINITIONS 

20 Unless defined otherwise, all technical and scientific terms used herein have the same 

meaning as is commonly understood by one of skill in the art to which this invention belongs. 
All patents, patent applications and publications referred to herein are incorporated by 
reference. 

As used herein, "nucleic acid" refers to a polynucleotide containing at least two 
25 covalently linked nucleotide or nucleotide analog subunits. A nucleic acid can be a 

deoxyribonucleic acid (DNA), a ribonucleic acid (RNA), or an analog of DNA or RNA. 
Nucleotide analogs are commercially available and methods of preparing polynucleotides 
containing such nucleotide analogs are known (Lin et al. (1994) Nucl Acids Res. 22:5220- 
5234; Jellinek et al (1995) Biochemistry 34:1 1363-1 1372; Pagratis et al (1997) Nature 
30 Biotechnol 75:68-73). The nucleic acid can be single-stranded, double-stranded, or a 

mixture thereof. For purposes herein, unless specified otherwise, the nucleic acid is double- 
stranded, or it is apparent from the context. 
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The term "nucleic acid" refers to single-stranded and/or double-stranded 
polynucleotides, such as deoxyribonucleic acid (DNA) and ribonucleic acid (RNA), as well 
as analogs or derivatives of either RNA or DNA. Also included in the term ^nucleic acidj) 
are analogs of nucleic acids such as peptide nucleic acid (PNA), phosphorothioate DNA, and 
other such analogs and derivatives. 

As used herein, DNA is meant to include all types and sizes of DNA molecules 
including cDNA, plasmids and DNA including modified nucleotides and nucleotide analogs. 

As used herein, nucleotides include nucleoside mono-, di-, and triphosphates. 
Nucleotides also include modified nucleotides, such as, but are not limited to, 
phosphorothioate nucleotides and deazapurine nucleotides and other nucleotide analogs. 

As used herein, the term "large nucleic acid molecules" or "large nucleic acids" refers 
to a nucleic acid molecule of at least about 0.5 megabase pairs (Mbase) in size, greater than 
about 0.5 Mbase, including nucleic acid molecules at least about 0.6. 0.7, 0.8, 0.9, 1 5 2, 3, 4, 
5, 6, 7, 8, 9, 10, 20 5 30, 40, 50, 60, 70, 87, 90, and 100, 200, 300, or 500 Mbase in size. Large 
nucleic acid molecules can include proteins associated with the molecule. Large nucleic acid 
molecules typically may be on the order of about 10 to about 450 or more Mbase, and may be 
of various sizes, such as, for example, from about 250 to about 400 Mbase, about 150 to 
about 200 Mbase, about 90 to about 120 Mbase, about 60 to about 100 Mbase and about 15 to 
50 Mbase. 

Examples of large nucleic acid molecules include, but are not limited to, natural 
chromosomes and fragments thereof, especially mammalian chromosomes and fragments 
thereof which retain a centromere and telomeres, artificial chromosome expression systems 
(ACes; also called satellite DNA-based artificial chromosomes (SATACs); see U.S. Patent 
Nos. 6,025,155 and 6,077,697), mammalian artificial chromosomes (MACs), plant artificial 
chromosomes, insect artificial chromosomes, avian artificial chromosomes and 
minichromosomes (see, e.g., U.S. Patent Nos. 5,712,134, 5,891,691 and 5,288,625). The 
large nucleic acid molecules may include a single copy of a desired nucleic acid fragment 
encoding a particular nucleotide sequence, such as a gene of interest, or may carry multiple 
copies thereof or multiple genes or different heterologous sequences of nucleotides. For 
example, ACes can carry 40 or even more copies of a gene of interest. Large nucleic acid 
molecules may be associated with proteins. The protein can be any protein, including but not 
limited to, for example, chromosomal proteins, such as those that typically function to 
regulate gene expression and/or participate in determining overall structure and/or proteins 
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that are not typically associated with chromosomes in a cell. Examples of a protein not 
typically associated with a chromosome include, but are not limited to, recombinant proteins 
that have been combined with a nucleic acid, for example a large nucleic acid, within or 
outside of a cell. 

5 As used herein, a "functional fragment" of a chromosome refers to a fragment of a 

chromosome that retains the basic functional attributes of a chromosome. Thus, for example, 
a functional fragment of a chromosome can stably replicate and segregate in a cell in a 
manner similar to a chromosome from which the fragment is derived. A functional fragment 
of a chromosome can be generated in a number of ways. For example, a functional fragment 
10 can be generated by treating a complete, intact chromosome in such a way as to delete a 

portion or portions of the nucleic acid of the chromosome that are not required for the basic 
functioning of the chromosome in a cell. 

As used herein, an artificial chromosome is a nucleic acid molecule that can stably 
replicate and segregate alongside endogenous chromosomes in a cell. It has the capacity to 
15 act as a gene delivery vehicle by accommodating and expressing foreign genes contained 
therein. A mammalian artificial chromosome (MAC) refers to chromosomes that have an 
active mammalian centromere(s). Plant artificial chromosomes, insect artificial 
chromosomes and avian artificial chromosomes refer to chromosomes that include plant, 
insect and avian centromeres, respectively. A human artificial chromosome (HAC) refers to 
20 chromosomes that include human centromeres. For exemplary artificial chromosomes, see, 
e.g., U.S. Patent Nos. 6,025,155; 6,077,697; 5,288,625; 5,712,134; 5,695,967; 5,869,294; 
5,891,691 and 5,721,118 and published International PCT application Nos, WO 97/40183 
and WO 98/08964. An artificial chromosome can include proteins that are associated with 
the nucleic acid molecule. 
25 As used herein, the term "satellite DNA-based artificial chromosome (SATAC)" is 

interchangeable with the term "artificial chromosome expression system (ACes)". These 
artificial chromosomes are substantially all neutral non-coding sequences (heterochromatin) 
except for foreign heterologous, typically gene-encoding nucleic acid, that is interspersed 
within the heterochromatin for the expression therein (see U.S. Patent Nos. 6,025,155 and 
30 6,077,697 and International PCT application No. WO 97/40183). Foreign genes contained 
in these artificial chromosome expression systems can include, but are not limited to, nucleic 
acid that encodes traceable marker proteins (reporter genes), such as fluorescent proteins, 
such as green, blue or red fluorescent proteins (GFP, BFP and RFP, respectively), other 
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reporter genes, such as p-galactosidase and proteins that confer drug resistance, such as a 
gene encoding hygromycin-resistance. Other examples of heterologous DNA include, but 
are not limited to, DNA that encodes therapeutically effective substances, such as anti-cancer 
agents, enzymes and hormones, and DNA that encodes other types of proteins, such as 
antibodies. 

As used herein, the terms "heterologous" and "foreign" with reference to nucleic 
acids, such as DNA and RNA, are used interchangeably and refer to nucleic acid that does 
not occur naturally as part of a genome or cell in which it is present or which is found in a 
location(s) and/or in amounts in a genome or cell that differ from the location(s) and/or 
amounts in which it occurs in nature. It can be nucleic acid that is not endogenous to the cell 
and has been exogenously introduced into the cell. Examples of heterologous DNA include, 
but are not limited to, DNA that encodes a gene product or gene product(s) of interest 
introduced into cells, for example, for purposes of gene therapy, production of transgenic 
animals or for production of an encoded protein. Other examples of heterologous DNA 
include, but are not limited to, DNA that encodes traceable marker proteins, such as a protein 
that confers drug resistance, DNA that encodes therapeutically effective substances, such as 
anti-cancer agents, enzymes and hormones, and DNA that encodes other types of proteins, 
such as antibodies. 

As used herein, "delivery," which is used interchangeably with "transfection," refers 
to the process by which exogenous nucleic acid molecules are transferred into a cell such that 
they are located inside the cell. Delivery of nucleic acids is a distinct process from 
expression of nucleic acids. 

As used herein, "expression" refers to the process by which nucleic acid is translated 
into peptides or is transcribed into RNA, which, for example, may be translated into peptides, 
polypeptides or proteins. If the nucleic acid is derived from genomic DNA, expression may, 
if an appropriate eukaryotic host cell or organism is selected, include splicing of the mRNA. 
For heterologous nucleic acid to be expressed in a host cell, it must initially be delivered into 
the cell and then, once in the cell, ultimately reside in the nucleus. 

As used herein, cell recovery refers to a "total cell yield" after a specified time frame, 
which for purposes herein is twenty-four hours, and when used with reference to calculation 
of the clonal fraction. 

As used herein, cell recovery time refers to a time frame in order for a cell to 

equilibrate to new conditions. 
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As used herein, cell survival refers to cell viability after a cytotoxic event, such as a 
delivery procedure. 

As used herein, control plating efficiency (CPE) refers to the fraction of untreated 
cells, under standard optimal growth conditions for the particular cells, that survive a plating 
procedure. Plating efficiency refers to the fraction of treated cells that survive a plating 
procedure. 

As used herein, clonal fraction is a measurement of cell recovery after delivery of 
exogenous nucleic acids into cells and the plating efficiency of the cells. 

As used herein, transfer or delivery efficiency is the percentage of the total number of 
cells to which nucleic acids are delivered that contain delivered nucleic acid. 

As used herein, transfection efficiency is the percentage of the total number of cells to 
which nucleic acids including a selectable marker are delivered that survive selection and/or 
express the selectable marker. 

As used herein, index of potential transfection efficiency means the theoretical 
maximum transfection efficiency for a particular cell type under particular conditions, for 
example particular concentrations or amounts of particular delivery agents. 

As used herein, the term "cell" is meant to include cells of all types, of eukaryotes and 
prokaryotes, including animals and plants. 

As used herein, the term "cell cycle" includes the reproductive cycles through which a 
cell grows and divides to yield two or more daughter cells of the same or different ploidy. 
The cell cycle can be viewed as having several distinct phases, including, for example, in a 
eukaryotic organism, the Gl, S, G2 and M phases. The M phase is the mitotic phase which 
includes segregation of the pairs of chromosomes, the production of two separate nuclei and 
division of the cell into two daughter cells. The M phase can be further divided into 
prophase, prometaphase, metaphase, anaphase, telophase and cytokinesis. In meiosis, similar 
phases occur in both the first and second cell divisions that typically yield four daughter cells. 
Typically, the Gl, S and G2 phases are referred to as forming a period known as interphase 
during which the cell grows and progresses toward mitosis. During the S phase, DNA 
synthesis can occur in preparation for cell division. 

As used herein, "phase" with reference to a cell refers to a state of a cell relative to the 
cell cycle. Thus, for example, the phase of a cell can be a phase within the active cell cycle 
(e.g., Gl, S, G2 and M) or a phase out of the cell cycle, such as the resting phase referred to 
as GO in which the cell is quiescent and not actively passing through the cell cycle. 

8 
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As used herein, "cell cycle arrest agent" refers to any means chemical, physical or 
others used to stop a cell from passing a set point or points in the growth cycle of a cell. 

As used herein, "pre-selected phase" refers to a phase or point within a phase of a cell 
that specifically has been selected. The selection of a particular phase or point within a phase 

5 of a cell can be based on a determination of a characteristic of that point or phase. Thus, for 
example, the selection of a phase or point within a phase of a cell may be based on a 
determination that cells at that point or phase provide for improved transfer of a nucleic acid, 
particularly a large nucleic acid, into the cells relative to the transfer of the nucleic acid into 
cells at a different point or phase. Improved transfer can be, for example, any qualitative 

10 and/or quantitative improvement in any aspect of process and/or result of the transfer of the 
nucleic acid. Thus, improved transfer can be, for example, an increase in the efficiency of 
delivery and/or transfection of the nucleic acid into the cell, an increase in expression of the 
delivered nucleic acid into the cell and/or an increase in the integrity or intactness of the 
delivered nucleic acid in the cell. 

1 5 As used herein, "conditions that permit cell cycling" refers to conditions that allow for 

growth of cells. 

As used herein, a "synchronous population" with reference to a population of cells is 
one in which more cells in the population are in the same single phase than in an equivalent 
population of cells that has not been synchronized. Synchronization is effected by exposing a 

20 population of cells to conditions that result in an increase in the number of cells in the 

population that are in the same single phase. Such conditions are known by those of skill in 
the art and include but are not limited to natural and artificial treatments, such as, for 
example, changing any internal or external parameter of the cells, arresting cycling of the 
cells and releasing the cells from arrest, and physical separation of cells that are in different 

25 phases. An asynchronous population of cells is one which has not been exposed to any 
conditions that render the cells synchronous. 

As used herein, "delivery agent" refers to compositions, conditions or physical 
treatments to which cells and/or nucleic acids may be exposed in the process of transferring 
nucleic acids to cells in order to facilitate nucleic acid delivery into cells. Delivery agents 

30 include compositions, conditions and physical treatments that enhance contact of nucleic 

acids with cells and/or increase the permeability of cells to nucleic acids. In general, nucleic 
acids are not directly treated with energy, such as sonoporation. 
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As used herein, cationic compounds are compounds that have polar groups that are 
positively charged at or around physiological pH. These compounds facilitate delivery of 
nucleic acid molecules into cells; it is thought this is achieved by virtue of their ability to 
neutralize the electrical charge of nucleic acids. Exemplary cationic compounds include, but 
are not limited to, cationic lipids or cationic polymers or mixtures thereof, with or without 
neutral lipids, polycationic lipids, non-liposomal forming lipids, ethanolic cationic lipids and 
cationic amphiphiles. Contemplated cationic compounds also include activated dendrimers, 
which are spherical cationic polyamidoamine polymers with a defined spherical architecture 
of charged amino groups which branch from a central core and which can interact with the 
negatively charged phosphate groups of nucleic acids (e.g., starburst dendrimers). 

Cationic compounds for use as delivery agents also include mixtures of cationic 
compounds that include peptides and protein fragments. The additional components may be 
non-covalently or covalently bound to the cationic compound or otherwise associated with 
the cationic compound. 

As used herein, ultrasound energy is meant to include sound waves (for external 
application) and lithotripter-generated shock waves (for internal application). 

As used herein, electrical energy is meant to include the application of electric fields 
to cells so as to open pores in membranes for the delivery of molecules into the cell, e.g., 
electroporation techniques. 

As used herein, cavitation compound is meant to include contrast agents that are 
typically used with ultrasound imaging devices and includes gas encapsulated and 
nongaseous agents. These cavitation compounds enhance the efficiency of energy delivery of 
acoustic or shock waves. 

As used herein, "pharmaceutical^ acceptable" refers to compounds, compositions 
and dosage forms that are suitable for administration to the subject without causing excessive 
toxicity, irritation, allergic response or other undesirable complication. 

As used herein, embryonic stem cells are primitive, immature cells that are precursors 
to stem cells. 

As used herein, stem cells are primitive, immature cells that are precursors to mature, 
tissue specific cells. 

As used herein, nuclear transfer donor cells are cells that are the source of nuclei, 
which are transferred to enucleated oocytes during the process of nuclear transfer. 
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As used herein, the term "subject" refers to animals, plants, insects, and birds into 
which the large DNA molecules may be introduced. Included are higher organisms, such as 
mammals and birds, including humans, primates, rodents, cattle, pigs, rabbits, goats, sheep, 
mice, rats, guinea pigs, cats, dogs, horses, chicken and others. 

As used herein, "administering to a subject" is a procedure by which one or more 
delivery agents and/or large nucleic acid molecules, together or separately, are introduced 
into or applied onto a subject such that target cells which are present in the subject are 
eventually contacted with the agent and/or the large nucleic acid molecules. 

As used herein, "applying to a subject" is a procedure by which target cells present in 
the subject are eventually contacted with energy such as ultrasound or electrical energy. 
Application is by any process by which energy may be applied. 

As used herein, gene therapy involves the transfer or insertion of nucleic acid 
molecules, and, in particular, large nucleic acid molecules, into certain cells, which are also 
referred to as target cells, to produce specific gene products that are involved in correcting or 
modulating diseases or disorders. The nucleic acid is introduced into the selected target cells 
in a manner such that the nucleic acid is expressed and a product encoded thereby is 
produced. Alternatively, the nucleic acid may in some manner mediate expression of DNA 
that encodes a therapeutic product. This product may be a therapeutic compound, which is 
produced in therapeutically effective amounts or at a therapeutically useful time. It may also 
encode a product, such as a peptide or RNA, that in some manner mediates, directly or 
indirectly, expression of a therapeutic product. Expression of the nucleic acid by the target 
cells within an organism afflicted with a disease or disorder thereby provides a way to 
modulate the disease or disorder. The nucleic acid encoding the therapeutic product may be 
modified prior to introduction into the cells of the afflicted host in order to enhance or 
otherwise alter the product or expression thereof. 

For use in gene therapy, cells can be transfected in vitro, followed by introduction of 
the transfected cells into the body of a subject. This is often referred to as ex vivo gene 
therapy. Alternatively, the cells can be transfected directly in vivo within the body of a 
subject. 

As used herein, flow cytometry refers to processes that use a laser based instrument 
capable of analyzing and sorting out cells and or chromosomes based on size and 
fluorescence. 
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As used herein, a reporter gene includes any gene that expresses a detectable gene 
product, which may be RNA or protein. Preferred reporter genes are those that are readily 
detectable. Examples of reporter genes include, but are not limited to nucleic acid encoding a 
fluorescent protein, CAT (chloramphenicol acetyl transferase) (Alton and Vapnek (1979), 
5 Nature 282: 864-869) luciferase, and other enzyme detection systems, such as beta- 

galactosidase; firefly luciferase (deWet et al (1987), Mol Cell Biol 7: 725-737); bacterial 
luciferase (Engebrecht and Silverman (1984), PNAS 1: 4154-4158; Baldwin et al (1984), 
Biochemistry 23: 3663-3667); and alkaline phosphatase (Toh et al (1989) Eur. J. Biochem. 
182: 231-238, HsUetal (1983) J. Mol Appl Gen. 2: 101). 

10 As used herein, a reporter gene construct is a DNA molecule that includes a reporter 

gene operatively linked to a transcriptional control sequence. The transcriptional control 
sequences include a promoter and other optional regulatory regions, such as enhancer 
sequences, that modulate the activity of the promoter, or control sequences that modulate the 
activity or efficiency of the RNA polymerase that recognizes the promoter, or control 

1 5 sequences that are recognized by effector molecules, including those that are specifically 
induced by interaction of an extracellular signal with a cell surface protein. For example, 
modulation of the activity of the promoter may be effected by altering the RNA polymerase 
binding to the promoter region, or, alternatively, by interfering with initiation of transcription 
or elongation of the mRNA. Such sequences are herein collectively referred to as 

20 transcriptional control elements or sequences. In addition, the construct can include 

sequences of nucleotides that alter translation of the resulting mRNA, thereby altering the 
amount of reporter gene product. 

As used herein, promoter refers to the region of DNA that is upstream with respect to 
the direction of transcription of the transcription initiation site. It includes the RNA 

25 polymerase binding and transcription imitation sites and any other regions, including, but not 
limited to repressor or activator protein binding sites, calcium or cAMP responsive sites, and 
any such sequences of nucleotides known to those of skill in the art to alter the amount of 
transcription from the promoter, either directly or indirectly. 

As used herein, a promoter that is regulated or mediated by the activity of a cell 

30 surface protein is a promoter whose activity changes when a cell is exposed to a particular 
extracellular signal by virtue of the presence of cell surface proteins whose activities are 
affected by the extracellular protein. 
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B. CELL CYCLE 

1. Cell Cycle Phases 

In the reproductive cycle of a cell, the cell undergoes specific phases of DNA 
synthesis and division over a defined period of time. In a somatic mammalian cell, this time 
5 course is conventionally divided into four phases: Gl (first gap) phase, S (DNA synthesis) 
phase, G2 (second gap) phase, and M (mitotic) phase. Ninety percent or more of the total 
cell-cycle time course is dominated by the Gl, S and G2 phases, collectively termed 
interphase. The remaining time is devoted to the M phase, during which the cell actually 
divides. For example, in rapidly proliferating cells of higher eukaryotes, M phases generally 

1 0 occur only once every 1 6 to 24 hours, and each M phase itself lasts only 1 to 2 hours. Cells 
in different tissues, in different species, and at different stages of embryonic development 
have division cycles that vary greatly in duration, from less than an hour (for example, in the 
early frog embryo) to more than a year (for example, in the adult human liver). Although all 
phases of the cell cycle vary to some extent, by far the greatest variation occurs in the 

1 5 duration of Gl , which may be, in some cases, practically non-existant, or for such a length of 
time that the cell appears to have altogether ceased progressing through the divisional cycle 
and to have withdrawn into a quiescent state. Cells in such a quiescent Gl state are often said 
to be in the GO state. 

During the M phase or mitosis, the phase of cell division, the cell transitions through 
20 four substages: prophase, metaphase, anaphase, and telophase. In meiosis, similar substages 
occur in each of the two cellular division cycles that typically yield four daughter cells. 
Prophase involves the condensation of chromosomes and breakdown of nuclear membrane 
such that the nuclear contents are no longer separated from the cytoplasm. Metaphase is 
characterized by the migration of chromosomes to the equatorial plane of the cell. In 
25 anaphase, the chromosomes migrate to opposite poles of the cell segregating half sets of 

chromosomes to the two ends of the dividing cell. In late anaphase, constriction through the 
equator of the cell begins and results in the pinching off of two daughter cells in telophase. 
The daughter cells then begin interphase of a new cycle. 

Meiosis typically comprises two successive nuclear divisions with only one round of 
30 DNA replication. Four stages can be described for each nuclear division, similar to the stages 
of mitosis. The first division of meiosis may be divided into the following substages: 
prophase 1, when each chromosome dupicates and remains closely associated (called sister 
chromatids); metaphase 1, when homologous chromosomes align at the equatorial plate; 



WO 03/093469 



PCT/CA03/00645 



anaphase 1, when homologous pairs separate with sister chromatids remaining together; and, 
telophase 1, when two daughter cells are formed with each daughter containing only one 
chromosome of the homologous pair. Similarly, The second division of meiosis may be 
divided into the following substagesiprophase 2, when DNA does not replicate; metaphase 2, 
when chromosomes align at the equatorial plate; anaphase 2, when centromeres divide and 
sister chromatids migrate separately to each pole; and, telophase 2, when cell division is 
complete. The following detailed discussion relates largely to the mechanics of mitotic 
division, although it applies mutatis mutandis to cells undergoing meiosis. 

Interphase starts with the Gl phase, in which the biosynthetic activities of the cells, 
which proceed very slowly during mitosis, resume at a high rate. During Gl, the cell is 
subject to stimulation by extracellular mitogens and growth factors. In response to these 
stimuli, the cell passes through Gl and proceeds with DNA synthesis in S phase. The S 
phase begins when DNA synthesis starts, and ends when the DNA content of the nucleus has 
doubled and the chromosomes have replicated. The cell then enters the G2 phase, which 
continues until mitosis starts, initiating the M phase. 

2. Cell Cycle Control 

The regulation of the cell-cycle must ensure that the events in each phase are 
complete before moving to the next. To ensure completion of each phase, checkpoints for 
monitoring the integrity of DNA are strategically placed in Gl and at the G2/M interface to 
prevent progression and propagation of mutated or damaged cells. The Gl checkpoint can be 
described as the transition from early Gl, when cells have the option of withdrawing from the 
cell cycle into the GO state, and late Gl, when cells are committed to another round of 
genome replication and division. In mammalian cells, this transition is .also referred to as the 
"restriction point". In response to growth and/or mitotic signals, the cell traverses the 
restriction point and commits to proceeding through the cell cycle. In the absence of such 
signals, the cell may undergo differentiation, apoptosis, or enter the quiescent state (GO). 
Once committed, the cell cycle is propelled by a series of cyclin/cyclin dependent protein 
kinase (CDK) complexes. Thus, the transition between cell cycle stages is delineated by the 
synthesis and subsequent proteolytic degradation of various cyclins. 

For example, the mammalian cycle begins in Gl with increased expression of the D 
cyclins (Dl, D2, D3). D cyclin activity has been linked to cell size and external growth- 
regulatory signals coupling cell size with entry into a new cell cycle. The D cyclins associate 
with CDK4 and CDK6 resulting in phosphorylation and activation of the CDKs. Activated 
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CDK4/6 mediates a signal cascade resulting in the phosphorylation of the retinoblastoma 
protein and subsequent release of E2F transcription proteins and transcription of responder 
genes (including cyclin E). The responder proteins are required for Gl progression through 
the restriction point. 

5 As the cell progresses through late Gl, Cyclin E/CDK2 facilitates processes 

associated with the transition from Gl into S, primarily by shutting down the various braking 
systems that suppress S phase CDK activity in Gl. Cyclin E/CDK2 also initiates other early 
cell-cycle events, such as the duplication of the centrosome. Increased expression of cyclin A 
begins at the Gl/S transition and continues through S phase. Cyclin A binding to CDK2 

10 stimulates DNA synthesis. In late S phase cyclin A binds CDK1 . As the cell approaches M 
phase, cyclin B expression increases and peaks at metaphase. Cyclin B/CDK1 along with 
Cyclin A/CDK1 activity propels the cell through mitosis. The degradation of these cyclins 
result in exit from mitosis and cytokinesis. 

3. Chromosome Replication and Control 

1 5 The synthesis of genomic DNA in S phase is only one component of the chromosomal 

replication process. A number of important processes throughout the cell cycle and 
specifically cell division are coupled to the DNA replication process and the proteins and 
factors which drive this process. In addition, many of the proteins and factors that are crucial 
for progression through the various cell cycle phases may be useful targets for cell arrest 

20 agents. 

In eukaryotic cells, the DNA molecules of the chromosomes begin replication or 
synthesis at multiple start points or replication origins distributed along the individual 
chromosomes. Synthesis begins bidirectionally at these points and proceeds until the 
replication fork of adjacent start points meet. Although replication origins can vary greatly 

25 depending on cell type, they can be determined, in part, by the binding of the origin 

recognition complex (ORC), which has been conserved in evolution from yeast to humans. In 
most cells, one or more subunits of the ORC remain bound to origins through most of the cell 
cycle. For example, in multicellular eukaryotes, some ORC subunits remain bound 
throughout Gl, S and G2 phases. Early in the Gl phase of the cell cycle, proteins required 

30 for DNA synthesis, including Cdc6, Cdtl and six related proteins known as the MCM2-7 

complex, are recruited to the ORC to form the prereplicative complex (pre-RC). By blocking 
the recruitment of such proteins, the cell can be arrested in G1/G0 phase. In late Gl phase, 
the pre-RC matures into a pre-initiation complex (pre-IC) by recruitment of additional factors 
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including Cdc45, Sld3 and other factors involved in initiation and elongation during DNA 
synthesis. 

At the end of Gl, when the initiation complex at each replication origin is complete, a 
specific cyclin\CDK complex triggers the initiation of DNA synthesis and the S phase. In 

5 mammalian cells, this complex is cyclin A/CDK2. Another essential S phase promoting 
kinase is Cdc7 which interacts directly with the replication origin and/or ORC complex. 
After CDK and Cdc7 activation, origins are unwound and the heterotrimeric single-stranded 
DNA binding protein RPA is recruited. After early unwinding in early S phase, DNA 
polymerase a-primase complex is recruited to facilitate early synthesis. After initiation of 

10 early synthesis, additional DNA polymerases and auxiliary factors such as PCNA 

(proliferating cell nuclear antigen) are recruited to complete synthesis of each parental strand. 
As chromosomal DNA unwinds during synthesis, the topology of the DNA changes. 
Unwinding generates positive supercoils ahead of the replication fork and precatenanes in the 
newly replicated DNA behind the fork. The topology continues to change through mitosis. 

1 5 Agents which block the action or recruitment of required DNA synthesis machinery, such as, 
for example, DNA polymerase, ribonucleotide reductase or PCNA, will arrest the cell in S 
phase. PCNA also plays a central role in coupling many other cell cycle processes to the 
replication fork via direct protein protein interactions such as chromatid cohesion in late S 
phase and/or G2 phase. DNA damage and/or stalled replication forks trigger a global 

20 genome integrity checkpoint and recruitment of a number of kinases and other factors. 

Activation of this checkpoint prevents the firing of additional replication origins and prevents 
entry into mitosis, thus agents that target this checkpoint may be used to arrest cells in S 
and/or G2 phase. 

In G2 phase, after DNA synthesis, the two daughter DNA molecules remain tightly 
25 associated with each other in a process known as sister chromatid cohesion. Cohesion is 
believed to occur discontinuously across the chromatids at specific and numerous cohesion 
sites, including centromeres. In the G2 phase, cohesion may help facilitate post-replicative 
repair of any double stranded breaks. Cohesion is also important for events occurring in M 
phase. For example, after assembly of a bipolar mitotic spindle in metaphase, controlled 
30 proteolysis of cohesions allows the two sister chromatids to be pulled to opposite poles and 
initiate anaphase. Attachment of the chromatids to microtubules (which form the mitotic 
spindle) from opposite poles by kinetochores is a crucial step in the segregation process and, 
therefore, factors which facilitate this process are useful targets for M phase cell arrest agents. 
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Chromosome segregation coincides with cyclin B degradation which drives cytokinesis and 
mitotic exit. 

C. CELL PHASE-BASED DELIVERY OF NUCLEIC ACID INTO CELLS 

Provided herein are methods of delivering nucleic acids into cells in which the 
5 recipient cell is selected to be in a particular phase. The methods are particularly well suited 
for the delivery of large nucleic acids, including, for example, chromosomes and fragments 
thereof, and artificial chromosomes into cells. The methods utilize host cells that are in a 
phase that provide for transfer, and in particular improved transfer, of intact large nucleic 
acids into the cells. The phases or cycles of the cells in a host cell population that may be 
10 used in the methods can be synchronized to provide for an increased number of the cells that 
are in the same single phase for delivery of large nucleic acids into the cells. Methods for 
determining a cell phase for delivery of large nucleic acids into cells and methods for 
obtaining a cell or plurality of cells in a particular phase of the cell are also described herein. 
1. Cells 

1 5 The methods provided herein can be used in the delivery of nucleic acids into any 

cells, including, but not limited to, any eukaryotic and prokaryotic cells. Examples of cells 
that can be used in the methods include, but are not limited to, cell lines, primary cells, 
primary cell lines, plant cells and animal cells, including stem cells and embryonic cells. For 
example, fibroblasts, including lung and skin fibroblasts, fibroblast-like cells, synoviocytes, 

20 fibroblast-like synoviocytes, stem cells, including embryonic and adult stem cells, such as 
mesenchymal stem cells, myoblasts, lymphoblasts, carcinoma and hepatoma cells are among 
the many cells into which nucleic acids, and in particular large nucleic acids and artificial 
chromosomes, can be delivered and monitored using the methods provided herein. Particular 
cells include mammalian cells, for example, A9 cells (mouse fibroblasts, HPRT-; ATCC 

25 Accession no. CCL-1.4), CHO-S cells and DG44 cells (Chinese hamster ovary cells), V79 
cells (Chinese hamster lung fibroblasts; ATCC Accession no. CCL-39), LMTK-cells (mouse 
fibroblasts; ATCC Accession No. CCL-1.3), skin fibroblasts (including primary human 
foreskin fibroblasts), L8 cells (rat myoblasts; ATCC Accession No. CRL-1769), CCD1043 
SK cells (human fibroblasts; ATCC Accession No. CRL-2056), adult-derived mesenchymal 

30 stem cells (e.g., derived from human bone marrow; Carnbrex Biosciences, East Rutherford, 
New Jersey), synoviocytes (rat and human), Detroit 551 cells (human embryonic skin 
fibroblasts; ATCC Accession No. CCL-1 10), NS0 (murine myeloma, ECACC Accession No. 
851 10503), 293 cells (human embryonic kidney cells transformed by type 5 (Ad 5) DNA 
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(ATCC Accession No. CRL-1573), P46-FI (bovine lymphocyte-like cell line), DT40 
(chicken lymphoblasts), EJ30 cells (human bladder carcinoma), HepG2 cells (human 
hepatoma) and murine and bovine embryos. 

In particular embodiments, the methods of delivery of nucleic acids into cells 

5 provided herein can be used in delivering nucleic acids into cells in order to treat a disease or 
disorder, e.g., in gene therapy applications. In gene therapy applications, the nucleic acid to 
be delivered into a cell may encode a therapeutic molecule, e.g., a protein. In a particular 
example, the protein may be Factor VIE. In many instances, successful gene therapy 
applications are complicated by a requirement that large nucleic acids be delivered into cells. 

1 0 It may also be desired to provide multiple copies of nucleic acid encoding one or more 
therapeutic molecules. Compounding the difficulties in gene therapy methods is the 
challenge that cells preferred for use in gene therapy applications are often not readily 
transferable. The methods provided herein are particularly well suited for delivery of large 
nucleic acids, which may be in the form of artificial chromosomes or fragments thereof, into 

1 5 cells as may be used in therapeutic applications. 
2. Nucleic acids 

Although any nucleic acid may be used in the cell phase-based methods described 
herein, the methods are particularly well suited for delivery of large nucleic acids into cells. 
In particular embodiments, the large nucleic acids are chromosomes, fragments thereof and 

20 artificial chromosomes and any other large and/or complex nucleic acid. In further particular 
embodiments, the large nucleic acid is associated with one or more proteins through any type 
of binding or other interaction. The methods described herein provide for improved transfer 
of such nucleic acids into cells. For example, the cell phase-based methods described herein 
may provide for delivery of large nucleic acid molecules into cells with greater transfection 

25 efficiency and greater integrity of the molecules than methods that do not include use of a cell 
that is in a pre-selected phase. The efficiency of transfection of large nucleic acid molecules, 
and in particular chromosomes, into cells using these cell phase-based methods is at least 1.5, 
at least 2, at least 2.5, at least 3, at least 3.5, at least 4, at least 4.5, at least 5, at least 5.5, at 
least 6, at least 6.5, at least 7, at least 7.5, at least 8, at least 8.5, at least 9, at least 9.5 or at 

30 least 10-fold greater than the efficiency of transfection of the nucleic acid molecule to cells in 
an asynchronous population of cells. 

In particular embodiments, the methods provide for the delivery of chromosomes, 
fragments thereof or artificial chromosomes into cells. Artificial chromosomes include 
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Artificial Chromosome Expression System (ACes; also called satellite DNA-based artificial 
chromosomes (SATACs)) chromosomes, which are described, for example, in U.S. Patent 
Nos. 6,025,155 and 6,077,697 and PCT International Patent Application Publication No. WO 
97/40183, mammalian artificial chromosomes (MACs), plant artificial chromosomes, insect 
5 artificial chromosomes, avian artificial chromosomes and minichromosomes (see, e.g., U.S. 
Patent Nos. 5,712,134, 5,891,691 and 5,288,625). For other exemplary artificial 
chromosomes, see, e.g., U.S. Patent Nos. 5,695,967; 5,869,294; and 5,721,118 and published 
International PCT application No. WO 98/08964. 

In methods for delivering chromosomes, functional fragments thereof or artificial 

10 chromosomes into cells as provided herein, the phase of the chromosome can be a 

consideration. During the cell cycle, chromosomes undergo morphological and other 
changes. For example, the topology of chromosomal DNA changes as it is unwound during 
DNA replication in the S phase of the cell cycle. Unwinding generates positive supercoils 
ahead of the replication fork and precatenanes in the newly replicated DNA behind the fork. 

1 5 During mitosis, chromosomes undergo condensation and decondensation with accompanying 
changes in appearance detectable through microscopy. Typically, chromosomes begin to 
condense during prophase and to decondense during telophase of mitosis. Thus, the 
condition of a chromosome differs at different phases of the cell cycle. 

The condition of a chromosome may be a consideration first in isolating a 

20 chromosome from a cell and handling of the chromosome for subsequent transfer into a 

particular recipient cell. For example, condensation level may be a significant factor in the 
isolation and handling of chromosomes. Furthermore, the condition of a chromosome may 
impact its retention in a cell into which it is delivered. In particular methods provided herein 
for delivery of chromosomes or fragments thereof into cells, the phase of the chromosome or 

25 fragment thereof is the same or simliar to that of the recipient cell at the time of delivery. In 
further embodiments, the phase of the chromosome and of the recipient cell is the M phase, 
and, in particular, metaphase. 

3. Determining cell cycle phase to be used in a method for delivering nucleic 
acid to a cell 

30 In methods provided herein for delivering nucleic acids to cells, a recipient cell is in a 

pre-selected phase. The pre-selected phase can be one that can provide for improved transfer 
of a nucleic acid, such as a large nucleic acid, into the cell. The pre-selected phase can differ 
for different types of recipient cells and for different types of nucleic acids and can be 
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determined as described herein and using procedures known in the art. Improved transfer can 
take into account a number of factors. For example, delivery efficiency, transfection 
efficiency, integrity of the transferred molecules and cell viability after delivery are major 
considerations in determining improved transfer. Thus, for example, improved transfer may 
5 be an increased delivery or transfection efficiency attainable while still retaining intact 
transferred nucleic acids in a certain percentage of the recipient cells. 

(a) Assessing cell phase and chromosome phase 
One way to determine the phase of the cell cycle that a recipient cell should be in for 
delivery of the large nucleic acid into the cell involves determining and comparing the 
10 efficiencies of delivery and/or transfection, or comparing any other factor of relevance in 

achieving improved transfer, of nucleic acid into the cell in different phases of the cell cycle 
and selecting the cycle in which improved transfer of the nucleic acid is obtained. In order to 
compare the delivery and/or transfection efficiencies, or any other relevant parameter of 
transfer, the cell phases of the recipient cells are determined and the nucleic acid is contacted 
1 5 . with the cells of the different phases. 

Methods of assessing the phase of a cell or population of cells are described herein 
and are known in the art [see, e.g., Darzynkiewicz et al (2001) Seminars Hematol 55:179- 
193 and Nunez (2001) Curr, Issues Mol Biol J:67-70]. Cell phase may be assessed in a 
variety of ways including cytometric analysis, microscopic analysis, gradient centrifugation, 
20 elutriation and fluorescence techniques including immunofluorescence (which may be used in 
combination with, for example, any of the preceding techniques). Cytometric techniques 
include exposing the cell to an agent or stain, such as DNA-binding dyes, e.g., propidium 
iodide (PI), and analyzing cellular DNA content by flow cytometry. Immunofluorescence 
techniques include detection of specific cell cycle indicators such as, for example, PCNA, 
25 thymidine analogs and cyclins, with fluorescent antibodies. 

(i) Cytometric/Immunocytochemical methods 
Cell phase analysis by cytometry can be used to identify the distribution of cells in 
particular phases of the cell cycle, to determine the kinetics of progression through the phases 
and to monitor the molecular and functional mechanisms associated with the cell cycle, for 
30 example by immunohistochemical detection of components such as cyclins, inhibitors of 

cyclin-dependent kinases, cell cycle-associated protooncogenes and tumor suppressor genes. 
In one method of cell phase analysis using flow cytometry, the nuclear DNA content of a cell 
can be quantitatively measured at high speed as an indicator of cell cycle phase. DNA 
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content is a marker of cell phase because the DNA content of a cell changes between the 
several phases of the cell cycle. Cells in GO/1 phase have DNA content set equal to 1 unit of 
DNA; cells in S phase duplicate DNA, increasing its content in proportion to progression 
through S; and upon entering G2 and then M phases, cells have twice the GO/1 phase DNA 
5 content (i.e., 2 units of DNA). Thus, S phase cells have a DNA content that is intermediate 
between that of cells in Gl and G2/M (which have twice as much DNA as cells in Gl). 
•Univariate analysis of cellular DNA content allows discrimination of GO/1, S and G2/M 
phase cells. 

Flow cytometry measurement of cellular DNA content typically involves addition of a 

10 dye that binds stoichiometrically to DNA to a suspension of permeabilized cells or nuclei. 
Generally, cells are fixed or permeabilized, e.g., with a detergent, and then stained with a 
DNA-binding dye. Examples of such dyes include, but are not limited to, a nucleic acid- 
specific fluorochrome, propidium iodide (PI) or 4 '6 '-diamidino-2-phenylindole (DAPI). PI 
stains RNA in addition to DNA; thus, to avoid inclusion of measurement of fluorescence due 

15 to RNA in determining DNA content of a cell, it may be necessary to remove RNA by 

incubation with RNase. The DNA-bound PI emits red fluorescence when excited with blue 
light (488 ran). The DAPI-DNA complex can be excited by UV light (360 nm) and emits 
blue fluorescence. DNA can also be stained in live cells with the UV light-excitable 
fluorochrome Hoeschst 33242 which also emits blue fluorescence. Other DNA-binding dyes 

20 include, but are not limited to, Hoechst 33258, 7-AAD, LDS 751, and SYTO 16 (see, e.g., 

Molecular Probes Handbook of Fluorescent Probes and Research Chemicals, Haugland, Sixth 
Ed.; chapters 8 and 16 in particular). 

In general, the DNA-binding dye is contacted with with a cell (e.g., in concentrations 
ranging from about 1 ng/ml to about 5 (Xg/rnl) and taken up passively by the cell. The dye is 

25 allowed to incubate for some period of time which can depend in part on the particular dye 
being used and can be determined empirically. Once inside the cell, the dye binds to DNA, 
for example by intercalation, although in some cases the dye can be a major or minor groove 
binding compound). 

The stained material incorporates an amount of dye proportional to the amount of 
30 DNA. The stained material is then measured in the flow cytometer and the emitted 

fluorescent signal yields an electronic pulse with a height (amplitude) proportional to the total 
fluorescence emission from the cell. The results of fluorescence measurements can also be 
displayed as cellular DNA content frequency histograms which show the proportions of cells 
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in the various phases of the cycle based on differences in fluorescence intensity. Software 
containing mathematical models that fit the DNA histogram of a singlet have been developed 
to calculate the percentages of cells occupying the different phases of the cell cycle. Several 
manufacturers provide software for cell cycle analysis, including, for example, Becton 
Dickinson (CellFit™). 

Frequency histograms do not provide information about the rate of cell cycle 
progression or cell kinetics. However, the duration of a phase of the cell cycle (S;T S ) can be 
estimated if the duration of the cell cycle (T c ) or the doubling time of cells in culture is 
known. For example, the following equation may be used to estimate the duration of the S 
phase: T s /T c = In(F s + l)/ln2, where T s is the duration of the S phase, T c is duration of the 
cell cycle (or cell doubling timne in culture) and F s is a fraction of the S phase cells estimated 
from the DNA content frequency histogram. This equation applies to cells growing 
exponentially, when both daughter cells subsequently divide. 

Univariate DNA content analysis also cannot be used to distinguish noncycling, 
quiescent GO cells (e.g., normal quiescent cells that have not been stimulated by a mitogenic 
signal to enter the cell cycle and stem cells that can remain in GO for long periods of time) 
because GO cells have the same DNA content as cycling Gl cells. One way to distinguish 
cells in GO and Gl phases is based on differing RNA contents of cells in these phases. GO 
cells have a low RNA content reflecting a low number of ribosomes in the cells. 
Furthermore, cells in early and late stages of the Gl phase contain differing amounts of RNA. 
Cells in early Gl or Gl A, i.e., Gl cells in the growth phase, accumulate RNA and proteins 
but have an amount of RNA that is less than a threshold amount required in order for a cell to 
enter S phase. Thus, cells in the early Gl phase are referred to as a sub-threshold 
subpopulation. Cells in late Gl or GIB, which can enter the S phase without additional RNA 
or protein accumulation, have supra-threshold amounts of RNA. 

Bivariate cytometric analysis of DNA versus RNA content of cells allows 
discrimination between Gl and GO cells, as well as between Gl A and GIB cells, based on 
differences in RNA content. One method for differentially staining DNA and RNA exploits 
the metachromatic property of the cationic fluorochrome acridine orange (AO), which is 
excitable by blue light. Acridine orange interacts with DNA under specific staining 
conditions (proper concentration, ionic strength and pH) such that the DNA-AO intercalative 
complexes in the cell fluoresce green (530 nm). Under the same conditions, the RNA-AO 
complexes emit re (>640 nm) fluorescence, Acridine orange staining thus differentially 

22 



WO 03/093469 



PCT/CA03/00645 



stains DNA and RNA (green vs. red, respectively), and can be used in conjunction with 
analysis by flow cytometers having a single-laser (488 nm) excitation. If a two-laser 
excitation cytometer is used in the analysis of cells for varying RNA content, DNA and RNA 
can be distinguished by differential staining with a combination of Hoechst 33242 and 
5 pyronin Y [see, e.g., Shapiro (1998) Cytometry 2:143-150]. 

Another way to distinguish noncycling and cycling cells is based on differential 
expression of numerous proteins, such as, for example, proliferation-associated proteins. 
Antibodies against these marker proteins can be used for immunocytochemical identification 
of noncycling and cycling cells. For example, the antibody Ki-67 detects a protein that is 

10 ubiquitous in cycling but frequently absent in noncycling cells [see, e.g., Endl et al (2001) 
Meth Cell Biol 63:399-41 8]. The expression of protein reactive with Ki-67 varies during the 
cell cycle such that it is minimal in Gl cells and increases rapidly during S and G2 phases to 
reach a maximal amount in M cells. Proliferating cell nuclear antigen (PCNA), which is a 
co-factor of DNA polymerase A, is another proliferation-associated marker protein in cells. 

15 PCNA is specifically expressed in S-phase nuclei and is in the cytosol during Gl and G2/M 
[see, e.g., Larsen et al (2001) Meth Cell Biol 63:419-432]. Cytometric analysis of an 
immunocytochemically detectable marker protein can be used in conjunction with DNA 
content analysis to reveal the cell cycle phase specificity of marker expression. 

Univariate DNA content analysis also cannot be used to distinguish cells in G2 and M 

20 phases of the cell cycle, which have the same DNA content. There are, however, numerous 
markers for mitotic cells that have been adapted to cytometric methods that can be used to 
distinguish such cells from G2 phase cells. For example, histone H3 phosphorylation, which 
varies throughout the cell cycle, is a useful marker for mitotic cells. From prophase until 
telophase nearly all histone H3 molecules in chromatin are phosphorylated on serine-10, 

25 while during the remainder of the cell cycle only a small fraction of nucleosomes have 

phosphorylated H3. Antibodies that specifically recognize the phosphorylated epitope of 
histone H3 are available and can be used in immmunocytochemical detection of 
phosphorylated H3 in conjunction with flow cytometric methods [see, e.g., Juan et al (2001) 
Meth Cell Biol 63:343-354]. Bivariate analysis of DNA content versus phosphorylated 

3 0 histone H3 provides for identification of all four phases of the cell cycle. 

In another method of cell phase analysis using flow cytometry, cellular levels of 
cyclin proteins can serve as the basis for determining the phase of a cell and for 
distinguishing cells in different phases. Progression through the cell cycle is maintained by 
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timely, sequential phosphorylation and dephosphorylation of several intracellular proteins. 
Cyclins are proteins involved in the cell cycle control system in cells which works by 
phosphorylation of proteins to active or inactive forms. Cyclins bind to and activate cyclin- 
dependent kinases (cdks). Cyclin-dependent protein kinases influence the progression of the 
5 cell cycle by phosphorylating serine and threonine residues on select proteins that control the 
synthesis of cellular elements required for reproduction. Mammalian cells appear to contain 
separate cdks proteins that are specific for the Gl and G2 checkpoints through which a cell 
passes prior to progressing into the S and M phases, respectively. 

Distinct classes of cyclins are are synthesized during the Gl and G2 phases. In the 

10 normal vertebrate cell cycle control system, at least eight cyclins (cyclins A-H) and eight 

cdks proteins (cdc2 and CDK 2-8) are known. Gl cyclins (e.g., cyclins D and E) accumulate 
and bind cdks proteins during the Gl phase, and a different class of cyclins, mitotic cyclins, 
accumulate and bind cdks proteins during the G2 phase. For example, cell cycle progression 
can begin with induction of cyclin D; passage of the Gl checkpoint requires the binding of 

1 5 CDK2 protein with cyclin E; activation of DNA synthesis requires CDK2 protein binding 
with cyclin A, and entry into M phase requires formation of a complex between cdc2 and 
cyclin B [see, e.g., Merrill (1998) Meth. Cell Biol 57:229-249]. 

The cellular level of cdks proteins is substantially invariable during the cell cycle. In 
contrast, cyclins are synthesized and degraded as the cell progresses through the cell cycle. 

20 Following synthesis, cyclins bind to cdks proteins promoting the phosphorylation of target 
proteins by the cdks. Subsequently, the cyclins are degraded, the cdks are released and target 
protein phosphorylation is reduced or ceased. The rate-limiting step in the binding reactions 
between cdks and cyclins appears to be the synthesis of the appropriate cyclin. Thus, the 
cellular content of some of the cyclins, for example, cylcins D, E, A and B, oscillates 

25 detectably during the cell cycle. Cyclin expression can be analyzed immunocytochemically 
(e.g., using fluorescently labeled anti-cyclin antibodies) by flow cytometry and used in 
distinguishing cells in different phases of the cell cycle based on bivariate analysis of cyclin 
expression versus DNA content. Correlation of DNA content with changes in cyclins permits 
visualization of nine subdivisions of the cell cycle [see, e.g., Darzynkiewicz et al (1996) 

30 Cytometry 25:1-13]. 

Bivariate analysis of DNA content versus 5-bromo-2 c -deoxyuridine (BrdU) 
incorporation is another cytometric assay for assessing cell phase [see, e.g., Dolbeare et al 
(1983) Proc. Natl Acad. Set U.S.A. 50:5573-5577; Moranef al (1985) J! Histochem. 



WO 03/093469 



PCT/CA03/00645 



Cytochem. 35:821-827; Beisker et al (1987) Cytometry 5:235-239; Gray et al (1990) 
"Quantitative cell cycle analysis," in Flow Cytometry and Sorting, Melamed, Lindmo and 
Mendelsohn, eds., pp. 445-467; and Current Protocols in Cytometry (1997), Unit 7.7, John 
Wiley & Sons, Inc.]. BrdU is a thymidine analog that is incorporated into DNA during 
5 replication in cells exposed to the analog. DNA that has incorporated the analog can be 
detected imrnunocytochemically using fluorescein-tagged anti-BrdU antibodies. DNA 
content may be assessed, for example, by counterstaining with a red fluorescing intercalating 
fluorochrome such as, for example, PI or 7-aminoactinomycin D (7-AAD). Bivariate 
analysis of DNA content versus immunofluorescence of anti-BrdU antibody distinguishes S 
10 phase cells on the basis of their difference in DNA content from Gl or G2/M cells and also 
based on incorporation of the green fluorescing anti-BrdU antibodies. 

(ii) Centrifugation/EIutriation-based methods 
Centrifugation and centrifugal elutriation can be used to fractionate cells according to 

their size. Because cells in different phases differ in size, these methods can also be used to 
15 sort cells by cell phase and to thereby assess the phase of a cell. For example, early Gl phase 
cells are about half the size of mitotic or late G2 cells. 

(iii) Chromosome phase 

Chromosomes undergo morphological, ultrastructural and topological changes during 
progression of the cell cycle. Thus, chromosomes from cells in different phases of the cell 

20 cycle can be distinctive. The topology of a chromosome differs at different phases of the cell 
cycle. Interphase chromosomal DNA exists in various decondensed states to facilitate gene 
expression. , Chromatin in chromosomal regions that is not being transcribed exist 
predominently in the condensed form while regions being transcribed assume an extended 
form. Within S phase, chromosomal DNA is further dispersed as it unwinds during the 

25 replication process. Upon conclusion of S phase cohesion occurs to keep extended sister 
chromatids tightly associated. Typically, chromosomes begin to condense during prophase, 
undergoing several orders of supercoiling guided by histones and other facilitator proteins. 
Chromosomes are most dense during metaphase and begin to decondense again during 
telophase as the sister cells divide and normal transcription levels resume. Due to the various 

30 states in which chromosomal DNA exists throughout the cell cycle, when delivering DNA, 
particularly chromosomes, functional fragments thereof or artificial chromosomes into cells 
as provided herein, the phase of the isolated chromosome as well as the recipient cell 
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chromosome can be a consideration in providing for improved transfer into the cells. 
Chromosomal phase phase can be determined by the methods described herein, 
(b) Arresting cell cycle 

(i) Agents and methods of arresting cell cycle 
5 Cell arrest agents may be used to increase the amount of cells in a particular cell cycle 

phase within a population of cells or to arrest a particular cell at a particular phase. The agent 
may then be removed to allow the cell(s) to proceed through the cell cycle. Upon release 
from the agent, a population of cells will proceed more or less simultaneously through about 
one cell cycle. Arrest agents for use in the methods herein include compositions, conditions 

1 0 and physical treatments that arrest the cell cycle in a particular phase. Such agents include, 
but are not limited to, compounds and chemical compositions, peptides, proteins, 
temperature, light, pH, radiation and pressure. 

Physical conditions, for example, may include but are not limited to starvation 
techniques such as nutrient, growth factor or hormone deprivation. Such techniques lead to 

15 an accumulation of cells in a particular stage, usually the Gl or GO stage. Re-addition of 
nutrients or growth factors then allows the cells to re-enter the cell cycle in synchrony for 
about one generation. Most proliferating cell lines are known to respond to nutrient, growth 
factor or hormone deprivation in this manner. Two examples include the L929 mouse 
fibroblast and Nb2 rat lymphoma cell line. Culturing the L929 mouse fibroblast under serum 

20 starvation conditions (Glasgow Modified Minimal Essential Medium with 0.5% newborn calf 
serum and without glutamine) results in arrest of cells GO phase. Upon replacing the 
starvation medium with normal growing medium, all cells progressed through the cell cycle 
in a synchronized manner (see Marenzi et al (1999) Molec. Biol Reports 26:261-267). The 
Nb2 rat lymphoma cell line is absolutely dependent on prolactin for stimulation of 

25 proliferation (see Gout et al (1980) Cancer Res. 40:2433-2436). Culturing the cells in 
prolactin-deficient medium for 18-24 hours leads to arrest of proliferation, with cells 
accumulating early in the Gl phase of the cell cycle. Upon addition of prolactin, all the cells 
progress through the cell cycle until M phase at which point greater than 90% of the cells 
would be in mitosis. Deprivation techniques may be combined with additional arrest agents 

30 including other physical conditions, compounds or compositions, to increase the percentage 
of synchronized cells in the population. One or more agents may be used simultaneously or 
in succession to affect cell arrest. Temperature is another physical cell cycle arrest agent. 
For example, maintaining V79 cells at 25-32oC results in an accumulation of cells in G0/G1 . 
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A further physical arrest agent is incubation under pressurized nitrous oxide which results in 
arrest at M phase (Brinkley and Rao (1973) J. Cell Biol 55:96-106) 

Cell arrest compounds for use in the methods provided herein are available 
commercially or can be synthesized by those of skill in the art. Cell arrest compounds 
5 typically target specific phases of the cell cycle. S phase arrest agents include, among others, 
agents that affect DNA replication or earlier phases through action on DNA polymerase and 
ribonucleotide reductase. Aphidicolin, 5-aminouracil (a thymidine analogue) thymidine, and 
hydroxyurea are examples of S phase arrest agents. Hydroxylurea disrupts S phase, for 
example, by destroying an essential tyrosine-free radical on the ribonucleotide reductase 
10 enzyme. Thymidine causes overaccumulation of thymidine triphosphate, which allosterically 
inhibits the reduction of cytidine diphosphate by ribonucleotide reductase, thus starving the 
replication forks for deoxycytidine triphosphate. Aphidicolin blocks replication by directly 
biding and inhibiting DNA polymerase. M phase cell arrest can be mediated by agents that 
affect specific mitotic events such as microtubule polymerization and spindle arrest. M phase 
15 arrest agents include, for example, nocodazole, colchicine, demecolcine (colcemide), 

oryzalin, propyzamide, trifluralin, 8-hydroxyquinoline, paradichlorobenzene, vinblastine, and 
a number of other glucosides, alkaloids, and coumarins (see Sharma, AK (1956) Bot Rev 
22:665-695; Sharma AK and Sharma A (1999) Plant Chromosomes: Analysis, Manipulation, 
and Engineering; Sharma AK (1999) Methods Cell Sci 21 :73-78). Other arrest compounds 
20 for use in the methods provided herein include but are not limited to 1-p-D- 

Arabinofuranosylcytosine, olomoucine, roscovitine, 2-[N-(2-hydroxyethyl)]-N-(4- 
methoxybenzenesuffonyl)]amino-N-(4-^ daidzein, 
camptothecin, 5,6-DicWoro-l-I>-D-ribofuanozylbenzimidazole, 5-fluorouracil, 
dexamethasone, acycloguanosine, calpain inhibitor I, cytosine arabinoside, CeReS-18, 
25 trichostatin A, indole-3 -methanol, Staurosporine, tryprostatin B, and verruculogen. 

Any cell arrest agent or combination of arrest agents can be used for enhancing a 
particular method of delivering nucleic acid molecules, such as DNA, into a particular cell 
type using the provided methods. The amount of agent and duration of exposure of a cell to 
the agent is dependent on the agent and cell type. These parameters can be readily 
30 determined for any agent and any cell by the methods provided herein. 

In a particular embodiment of the method, one arrest agent is applied in an optimal 
amount and for an optimal duration to affect cell arrest. In another embodiment, one arrest 
agent is applied and removed in successive rounds or applied in different concentrations at 
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different time points throughout the cell cycle to effect cell arrest. For example, using a 
"double block" protocol, cells may be incubated in replication-inhibiting levels of an arrest 
agent, such as thymidine, for sufficient time to let G2, M, and Gl cells collect in early S, then 
cells are incubated in low levels of agent for sufficient time to let all S phase cells complete 
replication, and finally cells are incubated again in high levels of agent for sufficient time to 
allow all the newly generated G2, M, and Gl cells to collect in early S phase. In another 
embodiment two or more arrest agents are applied simultaneously to effect cell arrest. In a 
further embodiment, two or more agents are applied successively to effect cell arrest. For 
example, to boost the yield of cells arrested in M phase, cells can be first exposed to an S 
phase arrest agent such as aphidicolin or thymidine. The S phase block can then be removed 
and the cells further incubated under mitosis-arresting conditions. After arresting with the S 
phase agent, a greater number of the cells in the population will be in a relatively late cell 
cycle phase, thus a greater number of cells will collect in M phase during the latter 
incubation. 

One of skill in the art by using the provided optimization methods can readily 
determine which of the cell arrest agents or combination of agents are best suited for 
facilitating cell arrest and enhancing delivery of specific nucleic acid molecules, such as 
DNA, into a specific target cell type. These methods include determining the mean 
generation time, lengths of each phase of the cell cycle, fraction of cells proliferating in a cell 
population, arrest agent concentration, arrest agent incubation time, growth conditions, and 
nutrient conditions 

Cell arrest protocols may include contacting cells with arrest agents for periods of 
time related to the duration of specific cell cycle phases, for example, the double block 
method described above. Thus, optimization may require knowledge of the duration of each 
cell cycle phase. To determine the duration of each cell cycle phase, determination of the 
mean generation time (length of one cell cycle) may be required. The mean generation time 
of cells in a population can be calculated from the change in cell number with incubation 
time. For example, cell number as a function of incubation time may be plotted for several 
different time points on semilog paper. A best fit line can be drawn through the data points 
and the time take for cell number to double can be calculated, yielding the mean generation 
time. 

If the mean generation time is known, the duration of Gl, S, G2, and M phase may be 
estimated. The duration of each cell cycle phase may be estimated, for example, by 
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microscopy or flow cytometry. Microscopic methods may include autoradiography of 
labeled mitotic figures after pulse/chase labeling with [3H]thymidine (see e.g., Quastler and 
Sherman (1959) Exp. Cell Res. 77:420-438). When using flow cytometry, asynchronously 
growing cells can be stained with a DNA-intercalating dye such as propidium iodide, and 
5 fluorescence intensity can be used to determine the frequency of cells with a 2n, intermediate, 
or 4N complement of DNA. From a histogram of the DNA values, the phases can be 
determined manually or by computer algorithm (see e.g., Merrill, G. (1998) Methods Cell 
Biology; Cell Synchronization 57:229-249). 

In addition, knowing the fraction of cells that are proliferating in the cell population 

10 may be useful for optimizing cell arrest conditions. Many methods of analyzing the average 
duration of a cell division cycle and its component phases assume that most cells (>95%) in 
the population are actively proliferating and that cell death is low (<5%). If a significant 
fraction of a population is quiescent, then the amount of quiescent cells may need to be 
considered when determining cell cycle kinetics of a cell population. To determine the 

1 5 fraction of quiescent cells in a population, the number of cells that fail to replicate their DNA 
during an interval corresponding to twice the mean generation time can be calculated. To 
determine the replicative fraction, [3H]thymidine incorporation followed by autoradiography 
or bromodeoxy-uridine(BUdr)incorporation followed by immunostaining may be used. 

The optimum arrest agent concentration is the concentration that results in the greatest 

20 percentage of synchronized cells while maintaining acceptable levels of viability. Optimum 
arrest agent concentration may be determined by monitoring the dose response of a specific 
cell to a specific arrest agent. Dose response can be determined by titrating cells with an 
arrest agent and detecting cellular response. Cellular response can be detected, for example, 
by monitoring cell cycle phase and cell viability for each dose point over a period of one 

25 generation. Cell cycle phases can be monitored using cytometric analysis. In a specific 

example, for each dose point, cells are exposed to an agent or stain such as propidium iodide 
and the proportion of cells in G0-G1, S, G2, and M are determined by flow cytometry. PI 
binds to the DNA within a cell and the amount of DNA contained in a cell is determined by 
comparing experimental PI measurements with control PI measurements. Any dye that 

30 incorporates into DNA can be used in a similar manner to PL A cell that has not undergone 
DNA synthesis (S phase of the cell cycle) has 1 copy of the genome and is therefore in 
G0/G1 . Once S phase is initiated the amount of DNA is greater than one copy and at the end 
of S phase the amount of DNA is at 2 copies and the cell is then in the G2/M stage of the cell 
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cycle. Following nuclear and cellular division (M phase), each cell again has a single copy of 
the genome. 

Arrest agent incubation time is a key factor for optimizing cell arrest. Excessively 
long incubations lead to poor synchrony or low viability. Insufficiently long incubations lead 
5 to poor synchrony because not all cells in the population reach the arrest state. Optimum 
arrest agent incubation time can be determined by exposing the cell to the agent and 
monitoring cell cycle phase over a period of one generation as described above. The 
optimum arrest agent incubation time is the time point at which the majority of the cells are 
arrested in the same phase while maintaining acceptable levels of viability. 
1 0 Growth and nutrient conditions are dependent on cell type. Methods of determining 

growth and nutrient conditions are well known to those of skill in the art. 

Meiosis regulating agents may also be used to obtain cells in a preselected phase. 
Meiosis regulating compounds are for example disclosed in U.S. Patent No. 6,486,145; 
6,407,086; and 5,716,777 (all of which are incorporated herein by reference). 
15 (ii) Assessing cell cycle arrest 

Extent of cell cycle arrest can be determined using any methods of assessing cell 
phase as described herein or known in the art. 

(c) Releasing cells from cell cycle arrest 
In the methods provided herein for introducing nucleic acid molecules into cells, the 
20 nucleic acid molecules can be introduced into cells whose growth has been arrested, or into 
cells at a defined time point after release from cell cycle arrest. Release of cells from cell 
cycle arrest results in a synchronous population of cells, thus the defined time point 
corresponds to a particular phase of the cell cycle in the synchronous population of cells. 
Methods for releasing cells from cell cycle arest involve the restoration of normal growth 
25 conditions. Such methods are dependent on the cell arrest agent and type of cell used. For 
example, cells arrested by physical conditions such as nutrient, growth factor or hormone 
deprivation, are released from arrest by re-addition of the nutrient, growth factor, or hormone 
to the growth media. Cells arrested with a chemical agent may be released by removing the 
agent by washing and/or replacing the media as described, for example, in Example 10, or by 
30 the addition of an agent that counteracts or inhibits the arrest agent, for example, adding 2 
deoxycytidine 5 'mono-phosophate (dCMP) to release cells from thymidine arrest. Upon 
restoration of normal growth conditions the cells re-enter the cell cycle in synchrony for 
about one generation depending on the cell type. The cells then gradually return to their 
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logarithmically growing state. Re-entry into the cell cycle may be monitored by the methods 
described herein for monitoring cell cycle phase. 

(d) Assessing delivery efficiency 
Rapid Assessment Methods 

5 Rapid delivery assessment methods based on automated, sensitive and accurate 

analysis procedures, such as flow cytometry, can be used to evaluate delivery efficiency. 
Such methods are less time-consuming and more accurate then mainual detection of 
individual transfected cells by microscopic techniques. Using such rapid methods, it is 
possible to analyze nucleic acid molecule delivery data within 48 hours after transfection. 

10 Microscopic and colony formation analysis methods that may be used in evaluating stable 
nucleic acid molecule delivery rely on manual visualization or measurement of nucleic acid 
molecules (e.g., a selectable marker gene) expression, which is a distinct process from 
delivery. Such methods are associated with time delays in obtaining an assessment of the 
delivery method. 

15 Data collected by flow cytometry analysis are statistically superior due to the ease at 

which large numbers of events, e.g., nucleic acid molecule transfer, are collected. The 
positive values obtained in these methods are instrument derived and therefore not as 
susceptible to judgment errors. Thus, these methods provide for greater accuracy in assessing 
nucleic acid molecule delivery. In contrast, microscopic analysis is limited by the time 

20 involved for scoring positive events and sample size is restrictive. 

Because the methods of monitoring nucleic acid molecule delivery detect labeled 
nucleic acid molecules, such as DNA, and not a reporter gene expression product, it is 
possible to measure absolute values of nucleic acid molecules transferred, within twenty-four 
hours, without being hindered by cell autofluorescence and by the problems of differentiating 

25 wild-type cells from cells expressing low levels of reporter gene products (see, e.g., Ropp et 
al (1995) Cytometry 27:309-317). 

In rapid methods for monitoring nucleic acid molecule delivery, the nucleic acid 
molecules, such as DNA, to be delivered are labeled to allow for detection of the nucleic acid 
molecules in recipient cells after transfer into the cells. The nucleic acid molecules may be 

30 labeled by incorporation of nucleotide analogs. Any nucleic acid molecule analog that may 
be detected in a cell may be used in these methods. The analog is either directly detectable, 
such as by radioactivity, or may be detected upon binding of a detectable molecule to the 
analog that specifically recognizes the analog and distinguishes it from nucleotides that make 
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up the endogenous nucleic acid molecules, such as DNA, within a recipient cell. Analogs 
that are directly detectable have intrinsic properties that allow them to be detected using 
standard analytical methods. Analogs may also be detectable upon binding to a detectable 
molecule, such as a labeled antibody that binds specifically to the analogs. The label on the 
5 antibody is one that may be detected using standard analytical methods. For example, the 
antibody may be fluorescent and be detectable by flow cytometry or microscopy. 

In particular embodiments of these methods, the nucleic acid molecules, such as 
DNA, to be transferred is labeled with thymidine analogs, such as iododeoxyuridine (IdUrd) 
or Bromodeoxyuridine (BrdU). In preferred embodiments, IdUrd is used to label the nucleic 
10 acid molecules, such as DNA, to be transferred. The transferred IdUrd-labeled nucleic acid 
molecules, such as DNA, may be immunologically tagged using an FITC-conjugated anti- 
BrdU/IdUrd antibody and quantified by flow cytometry. Thus, the transfer of the labeled 
nucleic acid molecules, such as DNA, into recipient cells can be detected within hours after 
transfection. 

15 Microscopv-based Assessment Methods 

Microscopic techniques for visualizing chromosome or plasmid transfer using 
bromodeoxyuridine (BrdU) (see. e.g., Pittman et al J Immunol Methods 703:87-92 (1987)) 
are known in the art. These methods typically involve use of a large sample size to detect 
any low levels of transfer and further involve manual scoring. 

20 Factors to Consider in Addressing Delivery of Nuclei c Acids 

Delivery of nucleic acids, including DNA, into cells is a process in which nucleic 
acids are transferred to the interior of a cell. Methods for the delivery of nucleic acids may 
be assessed in a variety of ways, including the following. 

(i) Transfer or delivery efficiency 

25 A delivery method may be assessed by determining the percentage of recipient cells 

in which the nucleic acids, including DNA, is present (i.e., the transfer efficiency). However, 
when evaluating a delivery method for the ultimate goal of generating cells that express the 
transferred nucleic acid, there are additional factors beyond mere presence of the nucleic acid 
in recipient cells that should be considered. Included among these additional factors is 

30 integrity of the nucleic acids and cell viability. When assessing a proliferating cell 

population, clonogenicity is the method of choice to measure viability. When the target cells 
population is non-dividing or slow growing, metabolic integrity can be monitored. 
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(ii) Clonogenicity 

Clonogenicity represents a measure of the survivability of cells with respect to a 
delivery procedure, growth conditions and cell manipulations (e.g., plating). It is important 
to assess clonogenicity to determine whether a delivery procedure results in a sufficient 
number of viable cells to achieve a desired number of cells containing the transferred nucleic 
acid. 

Clonogenicity may be expressed as a clonal fraction. The clonal fraction is an index 
that is calculated by multiplying two separate fractions and normalizing to a control plating 
efficiency correction factor (CPE). The two separate fractions that are multiplied in this 
calculation are the fraction of cells that survive a delivery procedure (population cell yield) 
anii the fraction of cells that survive a plating procedure. The calculation is thus as follows: 

Colonal Fraction = # viable colonies after plating x # cells nost-transfection 

# cells plated # cells transfected x CPE 

The values used in this calculation for the number of cells post- transfection (i.e., 
post-delivery) and the number of colonies post-plating is based on cell or colony numbers at 
certain times in the process. For instance, the value for the number of cells post-transfection 
is representative of the number of cells at a time after nucleic acid delivery that is sufficient 
for the delivery process to be completed. This time may be determined empirically. 
Typically this time ranges from 4-48 hours and generally is about one day after transfection. 
Likewise, the value of the number of viable colonies post-plating is representative of the 
number of colonies at a time after nucleic acid delivery that is sufficient for the non-viable ' 
cells to be eliminated and the viable cells to be established as colonies. This time may be 
determined empirically. Typically this time ranges from that in which the average colony is 
made up of approximately 50 cells or generally is a time at which five cell cycles have 
passed. 

A correction factor is included to take into account the plating efficiency of control 
wells, which is the ratio determined by the number of colonies counted divided by the 
number cells initially plated (typically 600-1000 cells). For LM(tk-) and V79-4 cells, the 
value of the correction factor typically ranges from about 0.7 to about 1 .2 and may be, for 
example, 0.9. 

The number of cells plated should remain constant at 1000 (simplified plating 

efficiency assay) done in duplicate, except in the case where the CPE is below 0.3, then 
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number of cells seeded should be increased to a range of 5,000-50,000. If the CPE is below 
0.1-0.2, then a viable fraction analysis should be considered. 

(iii) Viable fraction 

If the target cell population is non-dividing or slowly dividing then reproductive or 
clonogenicity assays are not relevant. Less direct measurements of cell viability must be 
used to measure cell killing that monitor metabolic death rather than loss of reproductive 
capacity. These procedures include, for example: (1) membrane integrity as measured by dye 
exclusion, (2) inhibition of nucleic acid synthesis as measured by incorporation of nucleic 
acid precursors, (3) radioactive chromium release, and (4) MTT ASSAY (3-[4,5- 
dimethylthiazol-2-yl]-2,5-diphenyl-tetrazolium bromide). These methods are different from 
measurements of loss of proliferative capacity, as they reflect only immediate changes in 
metabolism, which can be reversed or delayed and hence lead to errors in estimation of cell 
viability. To minimize these errors, correlation of duplicate procedures is suggested. 

(iv) Potential transfection efficiency (PTE) and determination of 
Chromos Index (CI) 

In assessing a delivery method used to transfer nucleic acids to cells with the goal of 
expression of the nucleic acids, including DNA, therein, it is desirable to obtain an indication 
of the theoretical maximum percentage of cells that are viable and contain the nucleic acid 
out of the total number of cells into which nucleic acids were delivered. This is referred to as 
the potential transfection efficiency and may be calculated from existing or historical, 
experimental data sets and is determined as follows: 

Potential Transfection Efficiency (PTE) = Transfer Efficiency x (Clonal Fraction or 

Viable Fraction) x correction factor (CF) 

The Chromos Index (C.I.) is an effective and rapid method to determine the Potential 
Transfection Efficiency of a proliferating population by using experimental values of % 
labeled nucleic acid, such as ACes, delivery to measure transfer efficiency and clonal fraction 
measured using a simplified clonogenicity assay. 

Chromos Index (CI) = % labeled ACes delivery x estimated Clonal fraction x CF 

The values of the transfer efficiency and of the clonal fraction and viable fraction are 

calculated as described above. The correction factor (CF) takes into account sample size, 

34 



WO 03/093469 



PCT/CA03/00645 



sample time and control plating efficiency. If all these factors are constant for each variable 
i.e., sampling time and size then the correction factor will approach the inverse of the value 
for the C.P.E., i.e., such that the clonal fraction or transfer efficiency can still approach 100% 
even with a low CF, or in other words, if delivery and viability are 100%, then the maximum 
potential transfection efficiency will equal the plating efficiency of the control cells. The 
calculation of C.I. allows for determination of each variable optimization, with the goal being 
for parameters, such as transfer efficiency, clonal fraction, and CF to approach one (or 
100%). If sample size or time varies for either clonal fraction or transfer efficiency, then CF 
represents the extrapolated value based on slope or rate of change. An application of this 
assessment is provided in the EXAMPLES. 

A stable transfection efficiency of about 1% is in the range (1-100%) that is 
considered useful for the introduction of large nucleic acid molecules into target cells. It is 
possible, using methods provided herein, to predict which delivery methods have to be 
selected for achieving desired transfection efficiencies without having to grow transfectants 
for extended times under selective conditions and determine numbers of cells surviving 
selection marker expression. This analysis involves calculation of the Chromos Index (CI) 
which integrates a "biological" value (the clonal fraction) with a measurement of 
chromosomal "uptake" or transfer efficiency (percentage of cells containing delivered ACes). 
(e) Assessing integrity of transferred nucleic acid 
Assessment of delivery of nucleic acid molecules may also be combined with an 
assessment of nucleic acid molecule, such as DNA, expression and integrity in recipient cells 
to provide even further information concerning the overall process of nucleic acid molecule 
transfer for purposes of expression. 

It is of interest to evaluate the stability of the nucleic acid molecule, such as DNA, 
under the selected delivery conditions. Some delivery conditions and agents may have 
adverse effects on nucleic acid molecule structure. Furthermore, the labeling techniques used 
in certain methods of monitoring nucleic acid molecules, such as DNA, delivery may also 
impact nucleic acid molecules, such as DNA, structure and function. 

The effects of delivery conditions on nucleic acid molecules may be assessed in a 
variety of ways, including microscopic analysis. In a particular exemplary analysis of the 
stability of artificial chromosomes, e.g., ACes, the chromosomes are exposed to the 
conditions of interest, e.g., IdU labeling, and analyzed under a fluorescent microscope for the 
ability to remain intact and condensed after incorporation of nucleotide analogs. 
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Analysis of the transfected cells may be performed to identify and assess the integrity 
of transferred nucleic acids. For example, the cells may be analyzed for indications of 
transferred nucleic acids such as artificial chromosomes or chromosomal segments, the 
presence of structures that may arise in connection with such nucleic acids and/or the 
5 presence of such nucleic acids. The size and structure of a transferred nucleic acid such as an 
artificial chromosome or ACes may also be determined. Analysis of the cells typically 
involves methods of visualizing nucleic acid structure and particularly chromosome structure, 
including, but not limited to, G- and C-banding and fluorescent in-situ hybridization (FISH) 
analyses using techniques described herein and/or known to those of skill in the art. Such 

10 analyses can employ specific labelling of particular nucleic acids, such as satellite DNA 
sequences, heterochromatin, rDNA sequences and heterologous nucleic acid sequences, 
(f) Assessing expression of transferred nucleic acid 
To facilitate analysis of nucleic acid molecules, such as DNA, expression, it is 
desirable to include in the transferred nucleic acid molecules, such as DNA, a reporter gene 

15 that encodes a readily detected product. For direct detection, such reporter gene products 
include, but are not limited to green fluorescent proteins (GFP), Red Fluorescent protein 
(RFP), luciferases, and CAT. For indirect detection, reporter gene products include, but are 
not limited, to p-galactosidase and cell surface markers. 

By using, for example, artificial chromosomes such as ACes containing a GFP 

20 reporter gene, such as, but are not limited to, GFP coding sequences in combination with 

labeling of the ACes with DNA analogs, such as IdU, delivery and expression can be rapidly 
and accurately monitored. For example, following the delivery of IdU-labeled GFP gene- 
containing ACes to target cells by any of the described methods, the cells containing the 
ACes are split into two populations. One population is fixed and stained for IdU and analyzed 

25 ' by flow cytometry to determine percentage delivery. The other population is allowed to go 
through 4-5 cell divisions (approximately 72 hours), and the GFP fluorescence is measured as 
an indication of expression. 

Such studies have revealed that incorporation of the analog label does not affect GFP 
protein expression, which indicates that the methods may be combined to monitor delivery 

30 and early expression of the ACes, thus providing more information to rapidly evaluate the 
efficiency of delivery methods. The combined methods can also be used to map the 
biological events between the initial stages of delivery and early gene expression. 
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(g) Cell cycle synchronization 

The methods described herein provide for the transfection of a single cell or a 
population of two or more cells. To transfect a population of cells, it is often desirable to use 
a synchronous population of cells. A synchronous population of cells refers to a population 
5 in which a greater number of cells are in the same single phase compared to a corresponding 
asynchronous population. An asynchronous population of cells can be a population of cells 
in conditions that allow for normal exponential growth that have not been exposed to 
conditions that effect synchronization. Any method for obtaining a synchronous population 
of cells may be used in the methods described herein. 
10 Methods for obtaining a synchronous population of cells may include isolating cells at 

a specific cell cycle stage using mechanical means such as mitotoic detachment or elutriation 
or chemical means such as exposing the cell to a cell cycle arrest agent. Mitotic detachment 
(or mitotic shake-off) is a minimally disruptive way of synchronizing monolayer cells. When 
most monolayer cells enter mitosis, they round up and become only slightly attached to the 
1 5 substratum of the culture plate. Because of their size and loose attachment, mitotic cells can 
be isolated by shaking and transferring the medium containing the detached cells to a fresh 
culture vessel. To generate large numbers of cells, a greter number of mother culture plates 
or vessels may be used. 

Elutriation methods include sorting cells based upon morphological or biochemical 
20 features such as cell size, chromosome morphology, DNA content, base content, protein 

content (including cyclins), or other indicators of cell cycle phase such as enzymes or other 
factors that are expressed, degraded, activated or inactivated at defined phases of the cell 
cycle. Cells may also be sorted using combinations of the above cell cycle phase indicators. 
Combinations of cell cycle phase indicators can be used simultaneously or in consecutive 
25 sorting procedures. 

Centrifugal elutriation is one method of sorting cells based on size and density. The 
system consists of a specialized centrifuge rotor in which the centrifugal force and opposing 
bulk medium flow create a gradient, with smaller cells at the top and larger cells at the 
bottom. The rotor speed or medium flow is manipulated such that the gradient of size- 
30 separated cells is pushed toward the top and the small cells at the top of the gradient are 

eventually pushed out of the elutreation chamber and into a collection vessel. With further 
manipulation of the rotor speed and medium flow, progressively larger cells are pushed out of 
* the elutriation chamber. Since Gl cells are roughly half the size of mitotic or late G2 cells, 
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centrifugal elutreation can be used to fractionate cells according to their position in the cell 
cycle. Unlike the mitotic shake-off procedure, centrifugal elutriation can be used to 
synchronize both monolayer and suspension culture cells. 

Cell phase sorting can also be accomplished through the use of a flow cytometer and 
sorter such as the fluorescence-activated cell sorter (FACS) as described herein. After 
elutriation the number and volume of cells in different fractions can be assessed using a 
multichannel cell analyzer. An advantage of using physical means to create a synchronized 
population of cells is that the cells do not spend any time in a growth-arrested state and 
therefore, anabolic activity remains relatively constant. Such methods also avoid the possible 
side-effects from treament of cells with chemicals. 

Alternatively, cell synchronization agents may be used alone or in combination with 
physical sorting means to enhance a synchronous population of cells. Cell cycle arrest 
agents, as described herein, act by blocking the progression at a defined stage of the cell cycle 
which results in the accumulation of cells within the defined stage. Upon release from the 
arrest agent, the cells progress through the cell cycle in synchrony. When used in 
combination with cell sorting techniques, the cells may be sorted prior to or after release from 
the arrest agent. 

The percentage of cell cycle synchronization within a population can be determined at 
different time points by the methods described above. The percentage of cell cycle 
synchronization is a measure of the percentage of cells cycling together in any one given 
phase at any one time point. The maximum percentage of cell cycle synchronization 
achievable is 1 00% of the cell population cycling together in any one phase. The percentage 
of cell cycle synchronization achievable using the cell cycle sorting and/or arrest and release 
methods described herein is at least 50%, at least 60%, at least 70% at least 75%, at least 
80%, at least 85%, at least 90%, at least 95% of cells cycling together in a particular phase. 
D. METHODS FOR THE DELIVERY OF DNA INTO CELLS 

In methods provided herein for delivering nucleic acids to cells, a recipient cell in a 
pre-selected phase is contacted with a nucleic acid molecule, such as a large nucleic acid 
molecule. The contacting can be conducted in a variety of ways under a variety of 
conditions. Conditions and procedures can be determined, for example, by comparing the 
delivery and/or txansfection efficiencies (as can be calculated using methods described 
herein) achieved under various conditions. 
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The conditions used for delivering particular nucleic acid molecules, such as DNA, to 
recipient cells can depend on the particular nucleic acid molecule being transferred and the 
particular recipient cell. Particular conditions are those that result in the greatest amount of 
nucleic acid molecules, such as DNA, transferred into the cell nucleus with an acceptable 
degree of cell survival. Suitable conditions for delivery of particular pairings of nucleic acid 
molecules, such as DNA, and recipient cells can be determined using methods of monitoring 
nucleic acid molecules, such as DNA, delivery and methods of screening agents and 
conditions as provided herein or can be determined empirically using methods known to 
those of skill in the art. 

In determining which conditions to use for contacting a nucleic acid with a cell in a 
pre-selected phase, a number of parameters can be considered. A method for detection of 
delivered nucleic acid is provided. This method, which can be used for assessing delivery of 
any nucleic acid molecule, can be used as a rapid screening tool to optimize nucleic acid, e.g., 
chromosome, transfer conditions. 

In particular, delivery conditions can first be assessed for the ability to transfer nucleic 
acid molecules, such as DNA, into cells and to identify methods that provide a sufficient 
number of viable cells that express the transferred nucleic acid molecules, such as DNA. 
Once such conditions are identified, they can be optimized using delivery monitoring 
methods provided herein or known in the art and then assessed for the ability to provide for 
expression of the transferred nucleic acid molecules. 

1. Naked DNA 

"Naked" nucleic acid can be contacted with the recipient cell in the absence of any 
delivery agent or other procedures designed to facilitate transfer of nucleic acids into cells. 

2. Microinjection 

Nucleic acids can be delivered into cells through microinjection. Procedures for 
microinjection are known in the art. 

3. Delivery agents 

Delivery agents include compositions, conditions and physical treatments that 
enhance contact of nucleic acid molecules, such as DNA, with cells and/or increase the 
permeability of cells to nucleic acid molecules, such as DNA. Such agents include, but are 
not limited to, cationic compounds, peptides, proteins, energy, for example ultrasound energy 
and electric fields, and cavitation compounds. 
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Delivery agents for use in the methods provided herein include compositions, 
conditions or physical treatments to which cells and/or nucleic acid molecules, such as DNA, 
can be exposed in the process of transferring nucleic acid molecules, such as DNA, to cells in 
order to facilitate nucleic acid molecules, such as DNA, delivery into cells. For example, 
compounds and chemical compositions, including, but not limited to, calcium phosphate, 
DMSO, glycerol, chloroquine, sodium butyrate, polybrene and DEAE-dextran, peptides, 
proteins, temperature, light, pH, radiation and pressure are all possible delivery agents, 
(a) Cationic compounds 

Cationic compounds for use in the methods provided herein are available 
commercially or can be synthesized by those of skill in the art. Any cationic compound can 
used for delivery of nucleic acid molecules, such as DNA, into a particular cell type using the 
provided methods. One of skill in the art by using the provided screening procedures can 
readily determine which of the cationic compounds are best suited for delivery of specific 
nucleic acid molecules, such as DNA, into a specific target cell type, 
(i) Cationic lipids 

Cationic lipid reagents can be classified into two general categories based on the 
number of positive charges in the lipid headgroup; either a single positive charge or multiple 
positive charges, usually up to 5. Cationic lipids are often mixed with neutral lipids prior to 
use as delivery agents. Neutral lipids include, but are not limited to, lecithins; phospha- 
tidylethanolamine; phosphatidylemanolamines, such as DOPE 

(dioleoylphosphatidylethanolamine), DPPE (dipalmitoylphosphatidyl-ethanolamine), POPE 
(palmitoyloleoylphosphatidylethanolamine) and distearoylphosphatidylethanolamine; 
phosphatidylcholine; phosphatidylcholines, such as DOPC (dioleoylphosphatidylcholine), 
DPPC (dipalmitoylphosphatidylcholine), POPC (palmitoyloleoylphosphatidylcholine) and 
distearoyl-phosphatidylcholine; fatty acid esters; glycerol esters; sphingolipids; cardiolipin; 
cerebrosides; and ceramides; and mixtures thereof. Neutral lipids also include cholesterol 
and other 3pOH-sterols. 

Other lipids contemplated herein, include: phosphatidylglycerol; 
phosphatidylglycerols, such as DOPG (dioleoylphosphatidylglycerol), DPPG 
(dipalmitoylphosphatidylglycerol), and distearoyl-phosphatidylglycerol; phosphatidylserine; 
phosphatidylserines, such as dioleoyl- or dipalmitoylphosphatidylserine and 
diphosphatidylglycerols. 
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Examples of cationic lipid compounds include, but are not limited to: Lipofectin (Life 
Technologies, Inc., Burlington, Ont.)(l:l (w/w) formulation of the cationic lipid N-[l-(2,3- 
dioleyloxy)propyl]-N,N,N-trimethylammonium chloride (DOTMA) and 
dioleoylphosphatidylethanol-amine (DOPE)); LipofectAMINE (Life Technologies, 
5 Burlington, Ont, see U.S. Patent No. 5,334,761) (3 : 1 (w/w) formulation of polycationic lipid 
2,3-dioleyloxy-N-[2(spermine-carboxamido)emyl]-N,N-dimethyl-l- 

propanaminiumtrifluoroacetate (DOSPA) and dioleoylphosphatidyl-ethanolamine (DOPE)), 
LipofectAMINE PLUS (Life Technologies, Burlington, Ont. see U.S. Patent Nos. 5,334,761 
and 5,736,392; see, also U.S. Patent No. 6,051,429) (LipofectAmine and Plus reagent), 

10 LipofectAMINE 2000 (Life Technologies, Burlington, Ont.; see also International PCT 
application No. WO 00/27795) (Cationic lipid), Effectene (Qiagen, Inc., Mississauga, 
Ontario) (Non liposomal lipid formulation), Metafectene (Biontex, Munich, Germany) 
(Polycationic lipid), Eu-fectins (Promega Biosciences, Inc., San Luis Obispo, CA) (ethanolic 
cationic lipids numbers 1 through 12: C52H106N6O4.4CF3CO2H, C 8 8H 1 78N80 4 S2.4CF3C0 2 H, 

1 5 C 4 oH84N0 3 P.CF3C0 2 H, C 5 oHio3N 7 03.4CF 3 C02H, C 55 Hi i 6 N 8 0 2 .6CF 3 C02H, 

C49Hi02N 6 O3.4CF3CO2H,C44H 8 9N5O3.2CF3CO2H,C I0 0H 2 06Ni2O4S2.8CF3CO2H, 
Ci 62 H33oN2209.13CF 3 C02H, C 43 H8 8 N402.2CF 3 C0 2 H, C4 3 H 8 8N403.2CF 3 C0 2 H, C4iH 78 N0 8 P); 
Cytofectene (Bio-Rad, Hercules, CA ) (mixture of a cationic lipid and a neutral lipid), 
GenePORTER (Gene Therapy Systems Inc., San Diego, CA) (formulation of a neutral lipid 

20 (Dope) and a cationic lipid) and FuGENE 6 (Roche Molecular Biochemicals, Indianapolis, 
IN) (Multi-component lipid based non-liposomal reagent). 

(ii) Non-lipid cationic compounds 
Non-lipid cationic reagents include, but are not limited to SUPERFECTp (Qiagen, 
Inc., Mississauga, ON) (Activated dendrimer (cationic polymerxharged arnino groups) and 

25 CLONfectinp (Cationic amphiphile N-t-butyl-N e -tetradecyl-3-tetradecyl- 
aminopropionamidine) (Clontech, Palo Alto, CA). 

Pyridinium amphiphiles are double-chained pyridinium compounds, which are 
essentially nontoxic toward cells and exhibit little cellular preference for the ability to 
transfect cells. Examples of a pyridinium amphiphiles are the pyridinium chloride surfactants 

30 such as SAINT-2 (l-methyl-4-(l-octadec-9-enyl-nonadec-10-enylenyl) pyridinium chloride) 
(see, e.g., van der Woude et al. (1997) Proc. Natl. Acad. Sci. U.S.A. 94:\ 160). The 
pyridinium chloride surfactants are typically mixed with neutral helper lipid compounds, 
such as dioleoylphosphatidylethanolamine (DOPE), in a 1 :1 molar ratio. Other Saint 
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derivatives of different chain lengths, state of saturation and head groups can be made by 
those of skill in the art and are within the scope of the present methods, 
(b) Energy 

Delivery agents also include treatment or exposure of the cell and/or nucleic acid 
molecules, but generally the cells, to sources of energy, such as sound and electrical energy, 
(i) Ultrasound 

For in vitro and in vivo transfection, the ultrasound source should be capable of 
providing frequency and energy outputs suitable for promoting transfection. For example, 
the output device can generate ultrasound energy in the frequency range of 20 kHz to about 1 
MHz. The power of the ultrasound energy can be, for example, in the range from about 0.05 
w/cm2 to 2 w/cm2, or from about 0. 1 w/cm2 to about 1 w/cm2. The ultrasound can be 
administered in one continuous pulse or can be administered as two or more intermittent 
pulses, which can be the same or can vary in time and intensity. 

Ultrasound energy can be applied to the body locally or ultrasound-based 
extracorporeal shock wave lithotripsy can be used for "in-depth" application. The ultrasound 
energy can be applied to the body of a subject using various ultrasound devices. In general, 
ultrasound can be administered by direct contact using standard or specially made ultrasound 
imaging probes or ultrasound needles with or without the use of other medical devices, such 
as scopes, catheters and surgical tools, or through ultrasound baths with the tissue or organ 
partially or completely surrounded by a fluid medium. The source of ultrasound can be 
external to the subject 's body, such as an ultrasound probe applied to the subject 's skin 
which projects the ultrasound into the subject 's body, or internal, such as a catheter having 
an ultrasound transducer which is placed inside the subject «s body. Suitable ultrasound 
systems are known (see, e.g., International PCT application No. WO 99/21584 and U.S. 
Patent No. 5,676,151). 

When the cationic compound and nucleic acid molecules, such as DNA, are 
administered systemically, the ultrasound can be applied to one or several organs or tissues 
simultaneously to promote nucleic acid molecule delivery to multiple areas of the subject 's 
body. Alternatively, the ultrasound can be applied selectively to specific areas or tissues to 
promote selective uptake of the nucleic acid molecules, such as DNA. 

The transfection efficiency of the ultrasound can also be enhanced by using contrast 
reagents, which serve as artificial cavitation nuclei, such as Albunex (Molecular Biosystems, 
San Diego, CA), Imagent (Alliance Pharmaceutical, San Diego, CA), Levovist-SHU 
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(Schering AG, Berlin, Germany), Definity (E.L du Pont de Nemour, Wilmington, DL), STUC 
(Washington University, St Louis, MO) and the introduction of gaseous microbubbles. A 
contrast reagent can be introduced locally, such as a joint; introduced systematically, with the 
enhancement of cavitation efficiency by focusing lithotripter shock waves at a defined area; 
or by targeting a contrast reagent to a particular site and then enhancing cavitation efficiency 
by focusing lithotripter shock waves. 

(ii) Electroporation 

Electroporation temporarily opens up pores in a cell 's outer membrane by use of 
pulsed rotating electric fields. Methods and apparatus used for electroporation in vitro and in 
vivo are well known (see, e.g., U.S. Patent Nos. 6,027,488, 5,993,434, 5,944,710, 5,507,724, 
5,501,662, 5,389,069, 5,318,515). Standard protocols may be employed. 
E. IN VITRO AND EX VIVO DELIVERY OF NUCLEIC ACIDS TO CELLS 

1." In vitro Delivery 

Nucleic acid molecules, such as DNA, can be added to synchronized cells in vitro 
either separately or mixed with one or more delivery agents and with or without the 
application of ultrasound or electrical energy. In general, if energy is applied, it is applied 
prior to contacting the cells with the nucleic acid molecule. 

In general, nucleic acid molecules, such as DNA, mixed with cationic 
lipids/compounds can be added to synchronized cells as described in the EXAMPLES. 
Parameters important for optimization of the delivery of nucleic acid molecules, such as 
DNA, into target cells will be apparent to those of skill in this art. These parameters include, 
for example, the cell cycle phase during which nucleic acid molecule is added, the delivery 
method, the nucleic acid molecules, such as DNA, the concentration of nucleic acid 
molecules, the cell growth medium, the cell culture conditions, the length of time cells are 
exposed to the nucleic acid, the toxicity of the delivery method to the target cell type, and the 
amount and time of use of ultrasound or electroporation among other parameters. It may be 
necessary to optimize these parameters for different nucleic acid molecules, such as DNA, 
and target cell types. Such optimization is routine employing the guidance provided herein. 
In addition, the rapid screening method can provide direction as to what parameters may need 
to be adjusted to optimize delivery (see EXAMPLES). Alteration of culture conditions, time, 
reagent concentrations and other parameters, for use with different combinations of cationic 
compounds and target cell types and to optimize delivery, can be empirically determined. If 
ultrasound energy is required to be used to enhance transfection efficiency, it can be applied 
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as described below and in the EXAMPLES. Electroporation can be performed as described 
below or by any suitable protocol known to those of skill in this art. 

The contacting of cells with catioiric compounds and nucleic acid molecules, such as 
DNA, in separate and distinct steps can be generally carried out as described in the 
5 EXAMPLES. Those of skill in the art can readily vary the order of the application of the 
components to the target cell based on the disclosure herein. 

2. Ex vivo Gene Therapy 

Delivery of nucleic acid molecules, such as DNA, is carried out as described above in 
in vitro delivery. After selection has been completed, cells harboring the nucleic acid 
10 molecules, such as DNA, are introduced into the subject target by a variety of means, 

including injection, such as subcutaneous, intramuscular, intraperitoneal, intravascular and 
intralymphatic and intra-articular injection. The cells can be administered with or without the 
aid of medical devices such as arthroscopes, other scopes or various types of catheters. 

3. Gene Therapy in Connective Tissue and Rheumatic Diseases 

1 5 Rheumatoid arthritis (RA) is a chronic inflammatory disease characterized by joint 

inflammation and progressive cartilage and bone destruction. Treatment of RA is 
problematic with current strategies since relatively high systemic doses are necessary to 
achieve therapeutic levels of anti-rheumatic drugs in the joints. In addition, the available 
treatments are associated with significant untoward side effects. Gene therapy is thus a more 

20 efficient system for delivery of therapeutic molecules to the site of inflammation in the 

treatment of connective tissue diseases, rheumatic diseases and chronic erosive joint diseases 
such as RA, osteoarthritis, ankylosing spondylitis and juvenile chronic arthritis. 

In a diarthrodial movable joint, smooth articulation is ensured by the macromolecular 
structure of the articular cartilage which covers the ends of the bones. The cavity or joint 

25 space that occurs at the location of adjacent bones is lined by a tissue referred to as the 

synovium. The synovium contains macrophage-like type A cells (presumably derived from 
macrophage/monocyte precursors and exhibiting phagocytic activity) and fibroblast-like type 
B cells (more fibroblast in appearance and associated with production of hyaluronic acid and 
other components of the joint fluid). Underlying the synovium is a sparsely cellular 

30 subsynovium which may be fibrous, adipose or areolar in nature. Fibroblast-like 

synoviocytes (FLS) are distinguishable from normal fibroblast cells in the subintimal 
synovium by differential gene expression patterns. FLS have been shown to express high 
levels of uridine diphosphoglucose dehydrogenase (UDPGD), high levels of vascular cell 
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adhesion molecule- 1 (VCAM-1), intercellular adhesion molecule- 1 (ICAM-1) as well as 
CD44 (hyaluronic acid receptor), fibronectin receptor and J53 integrins. Sublining fibroblasts 
or fibroblasts from other sources do not express these markers or express them at lower levels 
[ see? e.g., Edwards (1995) Ann. Rheum. Dis. 54:395-397; Firestein (1996) Arthritis Rheum. 
3P:1781-1790; Edwards (2000) Arthritis Res. 2:344-347]. 

Disease progression in RA involves the thickening of the synovial lining due to the 
proliferation of fibroblast-like synoviocytes (FLS) and infiltration by inflammatory cells 
(e.g., lymphocytes, macrophages and mast cells). The normal biology of synoviocytes is also 
altered in the pathological process of RA, including invasion and destruction of articular 
cartilage and bone. In addition to the production of elastase and collagenase, synoviocytes 
mediate the pathophysiological process of RA by expression of cell surface proteins involved 
in the recruitment and activation of lymphocytes and macrophages within the synovium. 
Proliferation of synovial cells leads to a pannus tissue that invades and overgrows cartilage, 
leading to bone destruction and destruction of joint structure and function. Proinflammatory 
cytokines, for example, tumor necrosis factor-a (TNF-a) and interleukin-1 (IL-1) play key 
roles in inflammation and joint damage associated with RA. Pathological effects caused by 
these cytokines include leukocytic infiltration leading to synovial hyperplasia, cell activation, 
cartilage breakdown and inhibition of cartilage matrix synthesis. 

Nucleic acid transfer to rheumatoid synovial tissue may result in the production of 
mediators that inhibit inflammation or hyperplasia or provide toxic substances that 
specifically destroy the diseased synovium. Retroviral delivery of nucleic acid encoding 
interleukin-1 receptor antagonist (IL1-RA) ex vivo and transduction of synoviocytes has been 
used in gene therapy of RA in humans to inhibit inflammation [see, e.g., Evans (1996) 
Human Gen. Ther. 7:1261-1280 and Del Vecchio etal (2001) Arthritis Res. 3:259-263]. 
Adenoviral vectors have been proposed for delivery of nucleic acid encoding an IL-1 receptor 
antagonist to synoviocytes in in vivo transduction methods [see, e.g., U.S. Patent No. 
5,747,072 and PCT Application Publication No. WO 00/52186]. 

Artificial chromosomes provide advantages over virus-based systems for gene 
therapy. For example, artificial chromosome expression systems (ACes), and other artificial 
chromosomes as described in U.S. Patent Nos. 6,025,155 and 6,077,697 and PCT Application 
No. WO97/40183, serve as non-integrating, non-viral vectors with a large capacity for 
delivering large nucleic acids and/or multiple copies of a particular nucleotide sequence into 
cells, such as synoviocytes, both in vitro and in vivo. Such artificial chromosome systems 
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offer further advantages in that they allow stable and predictable expression of genes 
producing single or multiple proteins over long periods of time. 

The methods provided herein may be used to introduce large nucleic acids, such as, 
for example, artificial chromosomes, into primary cells, such as, for example, synoviocytes 
(e.g., fibroblast-like synoviocytes) and skin fibroblasts, and skeletal muscle fibroblast cell 
lines. Thus, included among the methods provided herein is a method for introducing 
heterologous nucleic acid into a synoviocyte by introducing in a chromosome, such as for 
example an artificial chromosome, into the synoviocyte. In one such embodiment, the 
artificial chromosome is an ACes. The synoviocyte can be, for example, a fibroblast-like 
synoviocyte. 

A particular method provided herein for introducing a large nucleic acid molecule into 
a synoviocyte includes steps of obtaining a synoviocyte that is in a pre-selected phase and 
contacting the synoviocyte with the nucleic acid molecule. La one method, the nucleic acid 
molecule is contacted with a synchronous population of synoviocytes in which the population 
has been synchronized to be in a pre-selected phase at the time of contact. 

Such methods can further include steps of exposing the nucleic acid molecule to a 
delivery agent and contacting the synoviocyte with the nucleic acid molecule. In a particular 
embodiment of this method, the delivery agent is not energy. In one embodiment, the large 
nucleic acid molecule is a chromosome. For example, the nucleic acid can be an artificial 
chromosome, such as an ACes. In a particular embodiment, the synoviocyte is a fibroblast- 
like synoviocyte. Any delivery agents, such as described herein, may be used in such 
methods. For example, the delivery agent can be one that includes a cationic compound. 

Also provided are methods for introducing a large nucleic acid molecule into a 
synoviocyte that include a step of obtaining a synoviocyte that is in a pre-selected phase and 
contacting the synoviocyte with the nucleic acid molecule. Such methods can further include 
steps of exposing the nucleic acid molecule to a delivery agent, exposing the synoviocyte to a 
delivery agent and contacting the synoviocyte with the nucleic acid molecule, whereby the 
nucleic acid molecule is delivered into the synoviocyte, and wherein the steps are performed 
sequentially in any order or simultaneously. In some embodiments of the method, if the 
delivery agent is energy, it is not applied to the nucleic acid molecule and it is not applied to 
the synoviocyte after contacting the synoviocyte with the nucleic acid molecule. The nucleic 
acid may be any nucleic acid. In particular embodiments, the nucleic acid is a large nucleic 
acid, chromosome, artificial chromosome or ACes. In a further particular embodiment, the 
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synoviocyte is a fibroblast-like synoviocyte. Delivery agents, such as described herein, may 
be used in such methods. For example, the delivery agent can be one that includes a catiorric 
compound. 

Another method for delivering a nucleic acid molecule into a synoviocyte provided 
herein includes steps of contacting the synoviocyte in the presence or absence of the nucleic 
acid molecule with a delivery agent, and applying ultrasound energy or electrical energy to 
the synoviocyte, wherein the contacting and applying are performed sequentially or 
simultaneously, and then contacting the synoviocyte with the nucleic acid molecule, whereby 
the nucleic acid molecule is delivered into the synoviocyte. The nucleic acid may be any 
nucleic acid. In particular embodiments, the nucleic acid is a large nucleic acid, 
chromosome, artificial chromosome or ACes. In a further particular embodiment, the 
synoviocyte is a fibroblast-like synoviocyte. Numerous delivery agents, including agents 
such as those described herein, may be used in such methods. For example, the delivery 
agent can be one that includes a cationic compound. In one embodiment, the energy is 
ultrasound. 

Thus, provided herein are methods of delivering nucleic acids, in particular, large 
nucleic acids, such as chromosomes, including artificial chromosomes, e.g., ACes, into 
primary cells, including synoviocytes and fibroblasts. These methods may be used in vitro 
and in vivo. 

Also provided herein is a synoviocyte in a pre-selected phase comprising a large 
heterologous nucleic acid, a heterologous chromosome or portion thereof, or an artificial 
chromosome. In one embodiment, the artificial chromosome is an ACes. Such synoviocytes 
include fibroblast-like synoviocytes. The synoviocytes may be from any species, including, 
but not limited to mammalian species. For example, synoviocytes containing large nucleic 
acids, such as for example artificial chromosomes (e.g., ACes) include primate synoviocytes, 
as well as rodent, rabbit, monkey, dog, horse and human synoviocytes. 

The ability to achieve delivery of large nucleic acids into such cells demonstrates the 
usefulness of the methods in gene therapy applications as well as in the testing in animal 
models of disease of possible therapeutic molecules for use in gene therapy methods. Thus, 
provided herein are methods of treating diseases or modulating disease processes which 
include steps of introducing a large nucleic acid molecule, chromosome or portion thereof, or 
artificial chromosome into a subject who has the disease. 
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A method for treating or modulating a rheumatic disease process in a subject is 
provided herein. In one embodiment, the method includes steps of introducing a large 
nucleic acid into the subject, wherein the large nucleic acid contains nucleic acid that is or 
that encodes an agent that modulates a rheumatic disease process. For example, the nucleic 
acid can be or can encode a molecule that has an anti-rheumatic effect. Processes associated 
with rheumatic diseases are known in the art and are described herein. For example, one such 
process is an inflammatory process that includes processes of cell activation, infiltration, 
proliferation and recruitment. In a particular embodiment of this method, the disease is 
rheumatoid arthritis. The nucleic acid may be, for example, a chromosome or portion thereof 
or an artificial chromosome, e.g., an ACes. In particular embodiments, the large nucleic acid 
is introduced into a site of inflammation in the subj ect. One possible site of inflammation is a 
joint 

Also provided is a method for treating a rheumatic disease in a subject in which a 
large nucleic acid is introduced into the subject, wherein the large nucleic acid contains 
nucleic acid that is or that encodes a therapeutic agent. For example, the nucleic acid can be 
or can encode a molecule that has an anti-rheumatic effect. In a particular embodiment of 
this method, the disease is rheumatoid arthritis. The nucleic acid may be, for example, a 
chromosome or portion thereof or an artificial chromosome, e.g., an ACes. In an 
embodiment of this method, the large nucleic acid is introduced into a site of inflammation in 
the subject. One possible site of inflammation is a joint. 

In the methods for modulating a rheumatic disease process or treating a rheumatic 
disease, the method may be practiced in any format, including ex vivo and in vivo formats. 
Thus, for example, the nucleic acid can be introduced into a cell in vitro and then transferred 
into the subject. Alternatively, the nucleic acid can be introduced into a cell in vivo. In a 
particular embodiment, the nucleic acid is introduced into a synoviocyte, which can be, for 
example, a fibroblast-like synoviocyte. The nucleic acid that is introduced can comprise any 
nucleic acid that is or that encodes a molecule that has an anti-rheumatic effect in the subject. 
For example, the molecule may alter, counteract or diminish a process of the disease. The 
molecule may ameliorate symptoms of the disease. Molecules that provide anti-rheumatic 
effects in subjects with RA are known in the art [see, e.g., Vervoordeldonk and Tak (2001) 
Best Prac. Res. Clin. Rheumatol i5:771-788 and WO 00/52186]. Such molecules include 
anti-inflammatory or immunomodulatory molecules. For example, interleukin-1 receptor 
antagonists, soluble interleukin-1 receptor, soluble tumor necrosis factor receptor, interferon- 
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6, interleukin-4, interleukin-10, interleukin-13, transforming growth factor JS, dominant 
negative IkappaB-kinase, FasL, Fas-associated death domain protein or CTLA-4 are among 
molecules that can have anti-rheumatic effects. 

Also provided is a method of identifying, evaluating or testing a nucleic acid as a 
potential therapeutic agent in the treatment of a connective tissue or rheumatic disease by 
introducing into an animal model of a connective tissue or rheumatic disease a large nucleic 
acid molecule. The nucleic acid molecule can be one that includes nucleic acid that is or 
encodes a candidate therapeutic agent. The method may include a step of determining if the 
nucleic acid molecule has any effects, and in particular any anti-rheumatic effects, on the 
animal. For example, in determining if the nucleic acid molecule has any effects on the 
animal, it may be evaluated whether one or more conditions of the disease is effected, such 
as, for example, amelioration of or reduction in an adverse condition. In a particular 
embodiment, the disease is a rheumatic disease, such as, for example rheumatoid arthritis. 
The animal is any animal in which the disease may be modeled. For example, the animal 
may be a mammal. In particular embodiments the animal is a monkey, rodent, rabbit, dog, 
cat, horse, cow, pig or primate. The large nucleic acid may be, for example, a chromosome, 
or portion thereof, or an artificial chromosome, for example, an ACes. In particular 
embodiments, the nucleic acid molecule is in a synoviocyte, such as, for example, a 
fibroblast-like synoviocyte. In further embodiments, the nucleic acid is introduced into a 
joint of the animal. The nucleic acid molecule may be introduced into the animal using in 
vitro or in vivo formats. For example, the nucleic acid can be introduced into a cell in vitro 
and then be transferred into the animal. In another embodiment, the nucleic acid is 
introduced into a cell in vivo. 

Animal models include, for example, animal models of RA. Several animal models 
of RA, and methods for generating such models, are known in the art. Such models include 
adjuvant-induced arthritis (AA) [see, e.g., Kong et al (1999) Nature 4023:304-309] and 
collagen type II-induced arthritis [see, e.g., Tak et al (1999) Rheumatology 35:362-369; Han 
etal (1998) Autoimmunity 2<?:197-208; Gerlagera/. (2000) J. Immunology 755:1652-1658]. 
For example, experimental induction of adjuvant-induced arthritis in Lewis rats leads to 
severe inflammation in the bone marrow and soft tissues surrounding joints accompanied by 
extensive local bone and cartilage destruction, loss of bone mineral density and crippling 
[see, e.g., Bendele etal (1999) Arthritis Rheum. 42:498-506]. 
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The following examples are included for illustrative purposes only and are not 
intended to limit the scope of the invention. 

EXAMPLE 1 
Preparation of Artificial Chromosomes 
A. GFP Chromosome contained in A9 cell line Plasmids 

Plasmid pJJRES-EGFP (see SEQ ID No. 13, plasmid obtained from Clontech, CA, and 
is well known, see, e.g., U.S. Patent Nos. 6,034,228, 6,037,133, 5,985,577, 5,976,849, 
5,965,396, 5,976,796, 5,843,884, 5,962,265, 5,965,396; see, also, U.S. Patent No. 4,937,190). 
This plasmid contains the internal ribosome entry site (IRES; Jackson (1990) Trends 
Biochem. 75:477-483; Jang et al. (1988)./. Virol. (52:2636-2643) of the encephalomyocarditis 
virus (ECMV) between the MCS and the enhanced green fluorescent protein (EGFP) coding 
region. This permits the gene of interest (cloned into the MCS) and the EGFP gene to be 
translated from a single bicistronic mRNA transcript. Plasmid pIRES2-EGFP is designed for 
selection, by flow cytometry and other methods, of transiently transfected mammalian cells 
that express EGFP and the protein of interest. This vector can also be used to express EGFP 
alone or to obtain stably transfected cell lines without drug and clonal selection. 

Enhanced GFP (EGFP) is a mutant of GFP with a 35-fold increase in fluorescence. 
This variant has mutations of Ser to Thr at amino acid 65 and Phe to Leu at position 64 and is 
encoded by a gene with optimized human codons (see, e.g., U.S. Patent No. 6,054,3 12). 
EGFP is a red-shifted variant of wild-type GFP (Yang et al. (1996) Nucl. Acids Res. 24:4592- 
4593; Kaasetal. (1996) Curr. Biol. 5:315-324; Jackson et al. (1990) Trends Biochem. 
75:477-483) that has been optimized for brighter fluorescence and higher expression in 
mammalian cells (excitation maximum = 488 nm; emission maximum = 507 nm). EGFP 
encodes the GFPmutl variant (Jackson (1990) Trends Biochem. 75:477-483) which contains 
the double-amino-acid substitution of Phe-64 to Leu and Ser-65 to Thr. The coding sequence 
of the EGFP gene contains more than 190 silent base changes which correspond to human 
codon-usage preferences (Jang et al. (1988) J. Virol. 52:2636-2643). Sequences flanking 
EGFP have been converted to a Kozalc consensus translation initiation site (Huang et al. 
(1990) Nucleic Acids Res. 18: 937-947) to further increase the translation efficiency in 
eukaryotic cells. 

Plasmid pIRES-EGFP was dervied from PIRESneo (orignally called pCIN4) by 
replacing the neo gene downstream of the IRES sequence with the EGFP coding region. The 
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IRES sequence permits translation of two open reading frames from one mRNA transcript. 
The expression cassette of pIRES-EGFP contains the human cytomegalovirus (CMV) major 
immediate early promoter/enhancer followed by a multiple cloning site (MCS), a synthetic 
intron (TVS; Huang et al (1990) Nucleic Acids Res. 18: 937-947), the EMCV IRES followed 
by the EGFP coding region and the polyadenylation signal of bovine growth hormone. 

Location of Features (with reference to SEQ ID No. 13): 

Human cytomegalovirus (CMV) immediate early promoter: 232-820; 

MCS 909-974; 

IVS 974-1269; 

IRES of ECMV 1299-1884; 

Enhanced green fluorescent protein (EGFP) gene 1905-2621; 
fragment containing the bovine polyA signal 2636-2913; 
Col El origin of replication 3343-4016; and 
Ampicillin resistance gene 5026-4168 

Propagation in E. coli 

Suitable host strains: DH5a, HB101, and other general purpose strains. Single-stranded DNA 
production requires a host containing an F plasmid such as JM101 or XLl-Blue. 
Selectable marker: plasmid confers resistance to kanamycin (30 H-g/ml) to E. coli hosts. 
E. coli replication origin: pUC 
Copy number: -500 

Plasmid incompatibility group: pMBl/ColEl 
pCHEGFP2 

Plasmid pCHEGFP2 was constructed by deletion of the Nsil/Smal fragment from 
pIRES-EGFP. Plasmid pIRES-EGFP contains the coding sequence for a 2.1 kB Nru 1/Xho 
fragment of pCHEGFP2 containing the CMV promoter, synthetic intron, EGFP coding 
sequence and bovine growth hormone polyadenylation signal. Digestion of pIRES-EGFP 
with Nru 1 and Sma 1, yielded a 2.1 kb fragment. Digested DNA was fractionated by agarose 
gel electrophoresis, the separated band was excised and then eluted from the gel using the 
Qiaex 1 1 gel purification system (Qiagen, Mississauga, Ontario). 
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pFK161 

Cosmid pFK161 was obtained from Dr. Gyula Hadlaczky and contains a 9 kb Notl 
insert derived from a murine rDNA repeat (see clone 161 described in PCT Application 
Publication No. WO97/40183 by Hadlaczky etal. for a description of this cosmid). This 
cosmid, referred to as clone 161 contains sequence corresponding to nucleotides 10,232- 
15,000 in SEQ ED NO. 16. It was produced by inserting fragments of the megachromosome 
(see, U.S. Patent No. 6,077,697 and International PCT application No. (WO 97/40183); for 
example, H1D3, which was deposited at the European Collection of Animal Cell Culture 
(ECACC) under Accession No. 96040929, is a mouse-hamster hybrid cell line carrying this 
megachromosome) into plasmid pWE15 (Stratagene, La Jolla, California) as follows. Half of 
a 100 ul low melting point agarose block (mega-plug) containing isolated SATACs was 
digested with Notl overnight at 37°C. Plasmid pWE15 was similarly digested with iVbfl 
overnight. The mega-plug was then melted and mixed with the digested plasmid, ligation 
buffer and T4 ligase. Ligation was conducted at 16°C overnight. Bacterial DH5a cells were 
transformed with the ligation product and transformed cells were plated onto LB/Amp plates. 
Fifteen to twenty colonies were grown on each plate for a total of 1 89 colonies. Plasmid 
DNA was isolated from colonies that survived growth on LB/Amp medium and was 
analyzed by Southern blot hybridization for the presence of DNA that hybridized to a pUC19 
probe. This screening methodology assured that all clones, even clones lacking an insert but 
yet containing the pWE15 plasmid, would be detected. 

Liquid cultures of all 189 transformants were used to generate cosmid rninipreps for 
analysis of restriction sites within the insert DNA. Six of the original 189 cosmid clones 
contained an insert. These clones were designated as follows: 28 (~9-kb insert), 30 (~ 9-kb 
insert), 60 (~4-kb insert), 1 13 (~9-kb insert), 157 (~9-kb insert) and 161 (~9-kb insert). 
Restriction enzyme analysis indicated that three of the clones (113, 157 and 161) contained 
the same insert. 

For sequence analysis the insert of cosmid clone no. 161 was subcloned as follows. 
To obtain the end fragments of the insert of clone no. 161, the clone was digested with Notl 
and BamW and ligated with iVbfl/SamHI-digested pBluescript KS (Stratagene, La Jolla, 
California). Two fragments of the insert of clone no. 161 were obtained: a 0.2-kb and a 0.7- 
kb insert fragment. To subclone the internal fragment of the insert of clone no. 161, the same 
digest was ligated with tfamHI-digested pUC19. Three fragments of the insert of clone no. 
161 were obtained: a 0.6-kb, a 1.8-kb and a 4.8-kb insert fragment. 
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The insert corresponds to an internal section of the mouse ribosomal RNA gene 
(rDNA) repeat unit between positions 7551-15670 as set forth in GENBANK accession no. 
X82564, which is provided as SEQ ID NO. 5. The sequence data obtained for the insert of 
clone no. 161 is set forth in SEQ ID NOS. 6-12. Specifically, the individual subclones 
5 corresponded to the following positions in GENBANK accession no. X82564 (i.e., SEQ ID 
NO. 5) and in SEQ ID NOs. 6-12: 



Sublclone 


Start | End 


Site 


SEQ ID No. 




in X82564 






161kl 


7579 


7755 


Notl, BamHL 


6 


161m5 


7756 


8494 


BamHL 


7 


161m7 


8495 


10231 


BamYSl 


8(shows only sequence corresponding to nt 495- 
8950), 9 (shows only sequence corresponding to 
nt.9851- 10231) 


161ml2 


10232 


15000 


BainW 


10 (shows only sequence corresponding to nt 
10232-10600), 1 1 (shows only sequence 
corresponding to nt. 14267-15000) 


161R2 


15001 


15676 


Notl, BamHl 


12 



The sequence set forth in SEQ ID NOs. 6-12 diverges in some positions from the 

1 0 sequence presented in positions 7551-1 5670 of GENBANK accession no. X82564. Such 
divergence may be attributable to random mutations between repeat units of rDNA. 

For use herein, the rDNA insert from the clone was prepared by digesting the cosmid 
with Notl and BgDl and was purified as described above. Growth and maintenance of 
bacterial stocks and purification of plasmids were performed using standard well known 

1 5 methods (see, e.g., Sambrook et al. (1 989) Molecular Cloning: A Laboratory Manual, 2 nd 
Edition, Cold Spring Harbor Laboratory Press), and plasmids were purified from bacterial 
cultures using Midi - and Maxi-preps Kits (Qiagen, Mississauga, Ontario). 
B. PREPARATION OF THE GFP, MURINE A9 CELL LINE 
Cell culture and transfection 

20 The murine A9 cell line was obtained from ATCC and cells were thawed and 

maintained as described below. Briefly, cells were plated at a density of 2 X 10 6 cells per 15 
cm tissue culture dish (Falcon, Becton Dickinson Labware, Franklin Lakes, NJ) in growth 
medium containing of 90% DMEM (Canadian Life Technologies Burlington, ON) and 10% 
FBS (Can Sera, Rexdale ON), and were maintained at 37°C, 5% C0 2 . Cultures were 

25 routinely passaged when cells reached 70%-80% confluence. Sub culturing was carried out 
as follows: medium was removed by aspiration, 10 ml of IX trypsin-EDTA (Canadian Life 
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Technologies Burlington, ON) was dispensed onto the cell monolayer and the dish gently 
swirled to distribute the trypsin-EDTA. Finally, the bulk of the trypsin-EDTA was removed 
by aspiration, and the dish placed at 37°C for 5 minutes. To quench the trypsin-EDTA, 10 ml 
of growth medium was added to the dish, and the single cell suspension was transferred to a 
50 ml conical tube. Cell counts were performed using a cell counting apparatus (Beckman- 
Coulter, Hialeah FL). The cells were diluted and re-plated as described above. For cryo- 
storage, cultures were harvested by treatment with trypsin-EDTA, counted and the cell 
suspension then centrifuged at 500Xg for 5 minutes in a swinging bucket centrifuge. The cell 
pellet was resuspended in freezing medium containing 90% DMEM, 20% FBS and 10% 
DMSO (Sigma-Aldrich, Oakville, ON) at a density of 1 X 1 0 7 cells/ml. One ml aliquots of 
the cell suspension were then dispensed into cryo-vials (Nunc, Rochester NY), frozen over 
night in an isopropanol filled container (NUNC, Rochester NY) and placed at -70°C and then 
transferred to the gas phase of a liquid nitrogen freezer for long-term storage. 

A9 cells were transfected using the Ca 2 P0 4 co-precipitation method (see, e.g., 
Graham et al. (1978) Virology 52:456-457; Wigler et al. (1979) Proc. Natl. Acad. Sci. U.S.A. 
75:1373-1376; and (1990) Current Protocols in Molecular Biology, Vol. 1, Wiley Inter- 
Science, Supplement 14, Unit 9.1.1-9.1.9). One day prior to transfection, A9 cells were 
plated at a density of 2 x 10 6 cells per 10 cm dish and 3 hours before transfection the medium 
was replaced with fresh growth medium. 140 ug of the 9 kb rDNA, Notl and 5 ug of the 2.1 
kB CMV-EGFP Xhol/Nrul fragments were mixed, co-precipitated and used to prepare the 
Ca 2 P0 4 co-precipitate (Calcium Phosphate Transfection System, (Canadian Life 
Technologies Burlington, ON) which was distributed onto 2 10-cm dishes of subconfluent A9 
cells. The DNA-Ca2P04 complexes were left on the cells for 18 hours, after which the 
precipitate was removed by aspiration and cells were subjected to glycerol shock for 1.5 
minutes. After glycerol shock, the cell monolayers were gently washed with 2 X 10 ml of 
dPBS (Canadian Life Technologies Burlington, ON), followed by addition of 10 ml pre- 
warmed growth medium. Finally dishes were returned to the incubator and were maintained 
at 37°C, 5% C0 2 . After 3 hours recovery, each dish was passaged onto 3X 15 cm tissue 
dishes 

GFP fluorescence of cultures was monitored visually during culture using an inverted 
microscope equipped with epifluorescence illumination (Axiovert 25; Zeiss, (North York 
ON) and #41017 Endow GFP filter set (Chroma Technologies, Brattleboro, VT). Enrichment 
of GFP expressing populations was carried out as described below. 
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Enrichment of GFP expressing cell populations by 
Fluorescence Activated Cell Sorting 

Cell sorting was carried out using a FACS Vantage flow cytometer (Becton Dickinson 
Immunocytometry Systems, San Jose, CA) equipped with turbo-sort option and 2 Innova 306 
lasers (Coherent, Palo Alto CA). For cell sorting a 70 urn nozzle was used. The sheath 
buffer was changed to PBS (maintained at 20 p.s.i.). GFP was excited with a 488 nm Laser 
beam and excitation detected in FL1 using a 500 EFLP filter. Forward and side scattering was 
adjusted to select for viable cells. Only viable cells were then analyzed for GFP 
fluorescence. Gating parameters were adjusted using wild type A9 cells as negative control 
and GFP CHO cells as positive control. 

For the first round of sorting, A9 cells were harvested 4 days post-transfection, 
resuspended in 10 ml of growth medium and sorted for GFP expressing populations using 
parameters described above. GFP positive cells were dispensed into a volume of 5-10 ml of 
growth medium supplemented with IX penicillin/streptomycin (Canadian Life Technologies 
Burlington, ON) while non-expressing cells were directed to waste. The expressing cells 
were further diluted to 50 ml using the same medium, plated onto 2X15 cm dishes and 
cultured as described in the previous section. When the sorted populations reached 
confluence they were resorted to enrich for GFP expressing cells. A total of 4 sequential sorts 
were carried out, achieving enrichments of as high as 89% GFP expressing cells after the 
final sort. The final GFP expressing populations were expanded for cryo-preservation and for 
fluorescence in-situ hybridization screening (see below). Single cell clones were established 
from populations of interest by using the flow cytometer to direct GFP expressing single cells 
to individual wells of 96 well plates. These were cultured as described above. 
Fluorescence In-Situ Hybridization 

Fluorescence In-Situ Hybridization (FISH) screening was carried out on GFP 
enriched populations and single cell clones to detect amplification and/or artificial 
chromosome formation. Preparation of metaphase spreads and hybridizations were 
performed (see, Telenius et al. (1999) Chromosome Res 7:3-7). Probes used include pSAT 1, 
which recognizes the mouse major repeat (see, e.g., Wong et al. (1988) Nucl. Acids Res. 
16:1 1645-1 1661), pFK161, which hybridizes to the mouse rDNA-containing regions and a 
PCR generated probe against the mouse minor repeat. 

Thus, in one method provided herein for generating an artificial chromosome, such as 
an ACes, heterologous nucleic acid that includes a selectable marker, e.g., nucleic acid 
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encoding a fluorescent protein or other protein that may be readily detected using flow 
cytometry-based methods or other methods, including, for example, fluorimetry, cell imaging 
or fluorescence spectroscopy, is introduced into a cell. For example, rDNA and DNA 
encoding enhanced green fluorescent protein (EGFP) may be introduced into cells, e.g., A9 
cells. The transfected cells may be selected on the basis of properties detectable by flow 
cytometry-based methods, or other methods, including, for example, fluorimetry, cell 
imaging or fluorescence spectroscopy, e.g., fluorescent properties. For example, cells 
containing a fluorescent protein may be isolated from nontransfected cells using a 
fluorescence-activated cell sorter (FACS). If the sorting is conducted prior to chromosomal 
analysis of the cells for the presence of artificial chromosomes, it provides a population of 
transfected cells that may be enriched for artificial chromosomes and thus facilitates any 
subsequent chromosomal analysis of the cells and identification and selection of cells 
containing an artificial chromosome, e.g., ACes. For example, the cells may be analyzed for 
indications of amplification of chromosomal segments, the presence of structures that may 
arise in connection with amplification and de novo artificial chromosome formation and/or 
the presence of artificial chromosomes, such as ACes. Analysis of the cells typically 
involves methods of visualizing chromosome structure, including, but not limited to, G- and 
C-banding and FISH analyses using techniques described herein and/or known to those of 
skill in the art. Such analyses can employ specific labelling of particular nucleic acids, such 
as sateUite DNA sequences, heterochromatin, rDNA sequences and heterologous nucleic acid 
sequences, that may be subject to amplification. During analysis of transfected cells, a 
change in chromosome number and/or the appearance of distinctive, for example, by 
increased segmentation arising from amplification of repeat units, chromosomal structures 
will also assist in identification of cells containing artificial chromosomes. 
C. Purification of artificial chromosomes by Flow Cytometry and preparation of 
DNA from flow sorted chromosomes 

Artificial chromosomes were purified from the host cell by flow cytometry (see de 
Jong (1999) Cytometry 35:129-133). Briefly, purification was performed on FACS Vantage 
flow cytometer (Becton Dickinson Immunocytometry Systems, San Jose, CA) equipped with 
a Trubo-Soit Option and two Innova 306 lasers (Coherent, Palo Alto, CA). The Turbo Sort 
Option modification include increasing the maximum system pressure from 20 lb/in 2 to 60 
lb/in 2 , increasing the drop drive frequency from 50,000 drops/s to a maximum of 99,000 
drops/s and increasing the deflection plate voltages from a maximum 6,000 V to 8,000 V. 
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Other modifications are made to the instrument to accommodate the higher pressures. 
Hoechst 35258 was excited with the primary UV laser beam, and excitation detected in FLI 
by using 420 nm hand-pass niter. Chromomycin A3 was excited by the second laser set at 
458 nm and fluorescence detected in FL 4 by using a 475 nm long-pass filter. Both lasers 
had an output of 200 mW. Bivariate distributions (1,024 x 1024 channels) were accumulated 
during each sort. For all chromosome sorts, the sheath pressure was set at 30 lb/in2 and a 50 
urn diameter nozzle was installed. A drop delay profile was performed every morning and 
repeated after any major plug. Alignment of the instrument was performed daily by using 3 .0 
um diameter Sphero rainbow beads (Spherotech, Libertyville, IL). Alignment was 
considered optimized when a CV of 2.0% or less was achieved for FLI and FL4. 

Condensing agents (hexylene glycol, spermine and spermidine) were added to the 
sheath buffer to maintain condensed chromosomes after sorting. The sheath buffer contains 
15 nM Tris HC1, 0.1 mM EDTA, 20 mM NaCl, 1% hexylene glycol, 100 mM glycine, 20 uM 
spermine and 50 uM spermidine. The sorted chromosomes were collected in 1.5 ml screw- 
capped Eppendorf tubes at 4° C at a concentration of approximately 1 x 10 6 
chromosomes/ml, which were then stored at 4° C. 

For preparation of purified genomic DNA, sorted chromosome samples were brought 
to 0.5% SDS, 50 mm EDTA and 100 ng/ml Proteinase K, then incubated for 18 hours at 
50°C. 1 ul of a 20 mg/ml glycogen solution (Boehringer Mannheim) was added to each 
sample, followed by extraction with an equal volume of Phenol: Chloroform: Isoamyl 
Alcohol (25:24:1). After centrifugation at 21,0OOXg for 10 min, the aqueous phases were 
transferred to fresh microfuge tubes and were re-extracted as above. 0.2 volumes of 10 M 
NH40AC, 1 ul of 20 mg/ml glycogen and 1 volume of iso-propanol were added to the twice 
extracted aqueous phases which were then vortexed and centrifuged for 1 5 minutes at 
3O,000Xg (at room temperature). Pellets were washed with 200 ul of 70% ethanol and re- 
centrifuged as above. The washed pellets were air-dried then resuspended in 5mM Tris-Cl, 
pH 8.0 at 0.5-2 X 10 6 chromosome equivalents/ul. 

PCR was carried out on DNA prepared from sorted chromosome samples essentially 
as described (see, Co etal. (2000) Chromosome Research 5:183-191) using primers sets 
specific for EGFP and RAPSYN. Briefly, 50 pi PCR reactions were carried out on genomic 
DNA equivalent to 10,000 or 1000 chromosomes in a solution containing 10 mM Tris-Cl, pH 
8.3, 50mM KC1, 200 uM dNTPs, 500 nM of forward and reverse primers, 1.5 mM MgCb, 
1.25 units Taq polymerase (Ampli-Taq, Perkin-Elmer Cetus, CA). Separate reactions were 
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carried out for each primer set. The reaction conditions were as follows: one cycle of 10 min. 

at 95°C, then 35 cycles of 1 min. at 94°C, 1 min. at 55°C, 1 min at 72°C, and finally one 

cycle of 1 0 min at 72°C. After completion the samples were held at 4°C until analyzed by 

agarose gel electrophoresis using the following primers (SEQ ID Nos. 1-4, respectively): 

EGFP forward primer 5 '-cgtccaggagcgcaccatcttctt-3 

EGFP reverse primer 3 '-atcgcgcttctcgttggggtcttt-3 

RAPSYN forward primer 5 '-aggactgggtggcttccaactcccagacac-3 *; and 

RAPSYN reverse primer 5 £ -agcttctcattgctgcgcgccaggttcagg-3 '. 

All primers were obtained from Canadian Life Technologies, Burlington, ON. 

EXAMPLE 2 

Preparation of Cationic vesicles 

Vesicles were prepared at a lipid concentration of 700 nmoles/ml lipid (cationic 
lipid/DOPE 1:1) as follows. In a glass tube (10ml) 350 nmol cationic lipid (SAINT-2) was 
mixed with 350 nmol dioleoyl-phosphatidylethanolamine (DOPE), both solubilized in an 
organic solvent (Chloroform, Methanol or Chloroform/Methanol 1:1, v/v). 
Dioleoylphosphatidylethanolamine (DOPE; Avanti Polar Lipids, Alabaster, AL) forms 
inverse hexagonal phases in a membrane and weakens the membrane. Other effectors that 
may be used are cis-unsaturated phosphatidylemanolamines, cis-unsaturated fatty acids and 
cholesterol. Cis-unsaturated phosphatidylcholines are less effective. 

The solvent was evaporated under a stream of nitrogen (15 rnin/250 ul solvent at 
room temperature). The remaining solvent was removed totally by drying the lipid for 1 5 
min in an desiccator under high vacuum from a vacuum pump. To the dried mixture was 
added 1 ml ultrapure water. This was vortexed vigorously for about 5 min. The resulting 
solution was sonicated in an ultrasonication bath (Laboratory Supplies Inc. NY) until a clear 
solution was obtained. The resulting suspension contained a population of unilamellar 
vesicles with a size distribution between 50 to 100 nm. 



EXAMPLE 3 

Preparation of Cationic vesicles via alcoholic injection 

In a glass tube (10ml) 350 nmol cationic lipid (Saint-2) was mixed with 350 nmol 
DOPE, both solubilized in an organic solvent (chloroform, methanol or chloroform/methanol 
1/1). The solvent was evaporated under a stream of nitrogen (15 min/ 250 ul solvent at room 
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temperature). The remaining solvent was removed totally by drying the lipid for 15 min 
under high vacuum. This was then reconstituted in 100 ul pure ethanol. 

EXAMPLE 4 
Transfection of beta ACes into V79-4 cell line 

Transfection procedure for various transfection agents 

All compounds were tested in a Chinese Hamster lung fibroblast line (V79-4, ATCC 
number CCL-39). Approximately 17 hours (2 cell doublings) prior to transfection, 
exponentially growing cells were trypsinized and plated at 250,000 cells per well into a 6 
well petri dish with Dulbeccobs Modified Eagle Medium (Life Technologies, Burlington, 
ON) and supplemented with 10% FBS (Can Sera, Rexdale ON))- At the time of transfection, 
the number of cells per well was estimated to be approximately 1 million. For transfection, 
each individual manufacturers protocol for complexing to naked DNA was followed, with 
the exception that the amount of transfection agent used was varied, to reflect the different 
amount and type of DNA present, as well as the different ionic strength of the complexing. 
One million ACes (in a volume of 800 ul) were typically combined with the transfection 
agent in a wide range of concentrations (between 5 times and 100 times the lowest 
manufacturers suggested concentration). The ACes/transfection mixture was allowed to 
complex for the time recommended by the manufacturer, in volumes ranging from 0.8 ml to 
1 .9 ml; some manufacturers recommend adding media to the complexing reaction. The 
complexed mixture was then applied to the recipient cells and transfection allowed to proceed 
according to the manufacturers protocol. Details on the various conditions used with 
different agents are presented in Table 1 . 

Transfection procedure for Superfect agent 

Superfect was tested in a Chinese Hamster lung fibroblast line (V79-4, ATCC number 
CCL-39). Approximately 17 hours (2 cell doublings) prior to transfection, exponentially 
growing cells were trypsinized and plated at 250,000 cells per well into a 6 well petri dish 
with Dulbeccobs Modified Eagle Medium (Life Technologies, Burlington, ON) and 
supplemented with 10% FBS (Can Sera, Rexdale ON). One million ACes in 800ul of sort 
buffer was complexed to lOul of Superfect reagent. Complex was incubated at room 
temperature for 10 minutes. At the time of transfection, the number of cells per well was 
estimated to be approximately 1 million. Media was removed from wells and 600ul of 
DMEM and 10% FBS was added. SuperfectACes complex was added to the weUs drop- 
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wise and allowed to incubate for 3 hours at 37°C. After incubation, transfected cells were 
trypsinized and transferred to 15 cm dishes with 25 ml DMEM and 10% FBS and allowed to 
attach for 24 hours. After 24 hours, selection medium containing of 0.7 mg/ml hygromycin B 
was added to each well. The selection medium was changed every 2-3 days. After 10-12 
days colonies were screened for Beta-galactosidase expression and/or FISHed for detection 
of intact chromosome. 

Example of application of the determination of the Chromos Index 
Approximately 1 x 106 V79-4 cells were transfected with 1 x 106 IdUrd-labeled 
ACes complexed with a delivery agent (i.e., Lipofectamine PLUS and Lipofectamine or 
Superfect). The transfected cells were then fixed in ethanol. Fixed cells were denatured and 
exposed to FITC-conjugated antibody that specifically binds to BrdU/IdUrd-labeled nucleic 
acids. 

The percentage of transfected cells containing IdUrd-labeled ACes was determined 
using flow cytometry and collecting FITC fluorescence. Data were accumulated to form 
bivariate channel distribution showing forward scatter versus green fluorescence (IdUrd- 
FITC). The fluorescence level at which cells were determined to be positive was established 
by visual inspection of the histogram of negative control cells such that the gate for the 
negative cells was set such that 1% appeared in the positive region. 

The number of cells recovered at 24 hours post-transfection was determined by 
counting an aliquot using a Coulter Counter. To determine the control plating efficiency of a 
recipient cell line, the untreated cells were plated at 600-1000 cells per 10 cm petri dish in 
growth medium and left stationary in a 5% C02 incubator at 37°C for approximately five cell 
cycles or until average colony was made up of 50 cells. At this point the number of viable 
colonies was determined. The treated cells were seeded at 1000 cells if the CPE is above 0.1- 
0.2. If the CPE is low then the seeding density is increased to 5,000-50,000 cells per dish. 

EXAMPLE 5 

Ultrasound mediated transfection of LMTKQ cells with Lipofectamine 

LM(tk>) cells were grown at 37°C, 5% C02, in DMEM with 4500 mg/L D-glucose, 
L-glutamine, pyridoxine hydrochloride and 10% Fetal Bovine Serum. The comer wells of a 
12- well dish were seeded with 200,000 cells per well (this is to ensure no interference from 
the ultrasound waves from other wells) 24 hours before use. 
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The GFP chromosomes were counted to verify approximately 1X106 ACes per ml. 
The chromosomes were resuspended in the tube by flicking. Ten ul of chromosome 
suspension was removed and mixed with an equal volume of 30 mg/ml PI (propidium iodide) 
stain. Eight ul of the stained chromosomes was loaded onto a Petroff Hausser counting 
chamber and the chromosomes were counted. 

The medium was removed from the cells, and the cells were washed twice with HBSS 
(without phenol red, Gibco BRL) warmed to 37°C. 500 ul of the warmed HBSS was added 
to each well of cells (1 ul) LipofectAMINE (Gibco BRL) was added to each well. The plates 
were then sealed with parafilm tape and shaken gently at 20 rpm at room temperature for 30 
minutes (Stagger plates - 10 minutes for ease of handling). 

After incubation Ultrasound gel (Other-Sonic Generic Ultra sound transmission gel, 
Pharmaceutical Innovations, Inc., Newark, NJ) was applied to the 2.5 cm sonoporator head. 
Ultrasound was applied with an ImaRX Sonoporator 100 at an output energy of 2.0 
Watt/cm2, for 60 seconds, through the bottom of the plate of cells. After ultrasound of the 
well one chromosome per seeded cell (2X105) or 200 ul GFP ACes in sheath buffer (15 nM 
Tris HC1, 0.1 mM EDTA, 20 mMNaCl, 1% hexylene glycol, 100 mM glycine, 20 uM 
spermine and 50 uM spermidine) are added immediately to the well. (Repeat until all 
samples on the plate requiring ultrasound have been treated). The plate was then sealed once 
more with parafilm tape and shaken gently (20 rpm) for 1 hour at room temperature. 

After the incubation 1 ml (DMEM with 4500 mg/L D-glucose, L-glutamine and 
pyridoxine hydrochloride, 10% Fetal Bovine Serum, and a lx solution of penicillin and 
streptomycin from a 10000 units/ml penicillin and 10000 mg/ml Streptomycin, lOOx stock 
solution) was added to each well and the cells were incubated 18-24 hours at 37°C. 

The cells in the plates were then washed with antibiotic containing medium and 2 ml 
of medium was placed in each well. The cells continued to be incubated at 37°C with 5% 
C02 until 48 hours after transfection/sonoporation. The cells were then trypsinised and 
resuspended at a concentration of lx 106 in DMEM to be analyzed by flow cytometry. 

Results: Flow analysis was performed on a FACS Vantage (BDIS, San Jose, CA) 
equipped with a turbo-sort option and two Inova 305 lasers (Coherent, Palo Alto, CA). The 
GFP signal excitation is at 488 nm and the emission detected in FL1 using a 500nm long pass 
filter. Analysis of the transfected cells generated populations of GFP positive cells ranging 
from 13-27%. Non-sonoporated control value was 5%. 
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EXAMPLE 6 
Ultrasound mediated transfection with Saint-2 

A. Ultrasound mediated transfection of CHO-KI cells withSaint-2 

CHO-KI cells were grown at 37°, 5% C02, in CHO-S-SFM 2 Medium, (Gibco BRL, 
Paisley, UK). Between 2 x 105 and 5 x 105 cells were plated onto sterile glass slides in a 12 
well plate 24 h before usage. 

Transfection of the cells was performed as follows. The medium was removed from 
the cells, and the cells were washed twice with HBSS (Hanks balanced salt solution without 
Phenol Red (Gibco BRL, UK)) at 37°C. Then 500ul HBSS at 37°C was added per well, 
followed by 10 ul of the freshly prepared vesicle solution (prepared in Example 2) to yield a 
final concentration of 23.3 nmol/ml. 

Alternatively, the medium was removed from the cells, and the cells were washed 
twice with HBSS. 500 ul HBSS/lipid solution at 37°C was added to each well. The 
HBSS/lipid solution was prepared by adding 1 ul ethanolic lipid solution (prepared as 
described above) to 500 ul HBSS under vigorous vortexing. The plates were then sealed 
with parafilm tape and shaken gently at room temperature for 30 min. After incubation, 
ultrasound was applied at an output energy of 0.5 Watt/cm2 for 60 sec through the bottom of 
the plate to the cells. The ultrasound was mediated by an ultrasound gel (Aquasonic 100, 
Parker, NJ) between transducer and plate. The ultrasound was applied with an ImaRx 
Sonoporator 100. Immediately after applying ultrasound one GFP chromosome per seeded 
cell (2 x 105 - 5 x 105) (prepared in Example 1) was added. The plate was then sealed again 
and shaken gently for 1 h at room temperature. After the incubation 1ml medium (CHO-S- 
SFM 2 with 10% Fetal Calf Serum, 10000 ug/ml Penicillin and 10000 bg/ml Streptomycin 
Gibco BRL, Paisley, UK) was added to each well and the cells were incubated for 24 h at 
37°C. The cells were then washed with medium and 1 ml medium was added and the cells 
were incubated at 37° for another 24 h. Detection of expressed genes was then assayed by 
microscopy or detection of the transferred chromosome by FISH analysis. The negative 
control was performed in the same way, but with no chromosomes added to the cells. 
Results 

After transfection, using visual inspection, 30% of the cells remained on the glass 
slide of which 10% were positive for green fluorescent protein expression after 48 hours (3% 
of original population). After culturing for two weeks, FISH was performed on the cells and 
1.4% of the cells contained an intact artificial chromosome. 
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B. Ultrasound mediated transfection of Hep-G2 cells with Saint-2 

Hep-G2 cells were grown at 37°C, 5% C02, in DMEM with 4500 mg/1 Glucose, with 
Pyridoxine/HCL, 10% Fetal Calf Serum, 10000 pg/ml Streptomycin and 1000 pg/ml 
Penicillin. Between 2 x 105 and 5 x 105 cells were plated onto sterile glass slides in a 12 
wells plate 24 hours before usage. 

Cells were transfected with GFP chromosomes using the procedure of Example 6 A 
except that the CHO-KI medium was replaced with Hep-G2 medium. 

Results 

After transfection, 30% of the cells remained on the glass slide. 80% of these cells 
were positive for green fluorescent protein expression. 

C. Ultrasound mediated transfection of A9 cells with Saint-2 

A9 cells were grown at 37°C, 5% C02, in DMEM with 4500 mg/1 Glucose, with 
Pyridoxine/HCL, 10% Fetal Calf Serum, 10000 ug/ml Streptomycin and 10000 ug/ml 
Penicillin (GIBCO BRL, Paisley, UK). Between 2 x 105 and 5 x 105 cells were plated onto 
sterile glass slides in a 12 well plate 24 h before usage. 

Cells were transfected with GFP chromosomes using the procedure of Example 6 A 
except that CHO-KI medium was replaced with A9 medium. 

Results 

After transfection, 30% of the cells remained on the glass of which 50% were positive 
for green fluorescent protein expression. 

EXAMPLE 7 

Delivery of ACes into synoviocytes, skeletal muscle fibroblasts and skin fibroblasts 

A mammalian (murine) ACes artificial chromosome (-60 Mb) containing primarily 
murine pericentric heterochromatin, and including a reporter gene (lacZ) and ahygromycin B 
selectable marker gene, prepared as described in U.S. Patent Nos. 6,025,155 and 6,077,697 
and PCT Application Publication No. WO 97/40183 was delivered into primary rat 
fibroblast-like synoviocytes, rat skin fibroblasts and a rat skeletal muscle fibroblast cell line 
(L8 cells; ATCC Accession No. CRL-1769). Prior to delivery, the ACes were labelled with 
iododeoxyuridine (IdUrd) as described herein. 

Preparation of cells 

Primary fibroblast-like synoviocytes and rat skin fibroblasts were obtained from rats 
using standard methods [see, e.g., Aupperle et al. (1999) J. Immunol. 753:427-433 and 
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Alvaro-Garcia et al. (1990) J. Clin. Invest. 55:1790]. Such methods include isolation of 
synoviocytes from rodent knees generally by removal of skin and muscle, followed by 
mincing of knee joint tissue. The minced tissue is then incubated with collagenase, filtered 
through nylon mesh and washed extensively. Cells can be cultured overnight, after which 
time non-adherent cells are removed. Adherent cells can be cultured and passaged by 
replating at a dilution when the cultures reach confluence. The cells were plated at 50,000- 
75,000 cells per 6-well dish in media containing low glucose DMEM, 1-glutamine, 
penicillin/streptomycin and 20% FBS. The cells were grown in a 5% C02 incubator at 37°C 
for 3-5 days until approximately 80% confluence or 500,000 cells per well. 
Transfection of cells with ACes 

One million IdUrd-labelled ACes were complexed with 2, 5 or 10 ul of Superfect 
(Qiagen) or Lipofectamine Plus (Life Technologies; Gibco) as follows. Complexing with 
Superfect was conducted for 1 0 minutes at room temperature. For complexing with 
Lipofectamine Plus, the indicated amounts of PLUS reagent were added to 1 million ACes 
and complexed at room temperature for 15 minutes. Next the indicated amounts of 
Lipofectamine were added into 200 ul of low glucose DMEM (no FBS) and combined with 
the ACes/PLUS complex for 15 minutes at room temperature. The complexed ACes were 
then added dropwise to the cells in 600 ul media (final volume of approximately 1.4 ml). 
After 3 hrs at 37°C in a 5% C02 incubator, a total volume of 3 ml of culture media (low 
glucose DMEM, 1-glutarnine, penicillin/streptomycin and 20% FBS) was added. After 24-48 
hrs, the cells were trypsinized to form a single cell suspension, centrifuged to remove the 
supernatant and then fixed in cold 70% ethanol for a minimum of one hour. An aliquot of the 
fixed cells was saved for microscopic analysis. 

FITC-Conjugated antibody labelling of ACes 

Following transfection, the ACes were labelled with FITC-conjugated antibody that 
specifically binds to BrdU- or IdUrd-labelled nucleic acids and the cells were analyzed by 
FACs for FITC fluorescence and microscopic staining. Fixed cells were denatured in 2N 
HC1 and 0.5% Triton-X for 30 minutes at room temperature. After denaturation, the cells 
were neutralized by a series of wash steps at 4°C. To minimize background staining, the 
sample was resuspended in PBS and 4% FBS or BSA and 0.1% Triton-X (blocking buffer) 
for a minimum of 15 minutes. The cells were then pelleted and exposed for 2 hours at room 
temperature to FITC-conjugated antibody. After the cells were washed with blocking buffer, 
the sample was ready for flow cytometry analysis. Samples for microscopic analysis were 
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dried on slides and the above staining protocol was followed, except that BrdU/IdUrd 
antibody was diluted 1/5 and exposed to cells for 24 hours. 
Results 

The delivery of intact ACes was detected within 24 to 48 hours post transfection. The 
number of cells recovered at 24 hours post-transfection was determined by counting an 
aliquot using a Coulter Counter. To deterniine the control plating efficiency of a recipient 
cell line or plating efficiency of the transfected cells, the cells were plated at 1000-10,000 
cells per 10 cm petri dish in growth medium and left stationary in a 5% C02 incubator at 
37°Cfor 10 days. At that point, the number of viable colonies was determined. The 
normalized plating efficiency was calculated as described herein. 

When Superfect was used as a delivery agent, the percent delivery into fibroblast-like 
synoviocytes as determined by flow cytometry ranged from -24% to -66.3%. The 
normalized % plating efficiency was -36% when 2 ul of Superfect was used and -16% when 
5 ul of Superfect was used. Higher doses of Superfect were associated with toxicity and 
multiple ACes per cell as compared to lower doses. When Lipofectamine Plus was used as a 
delivery agent, the percent delivery into fibroblast-like synoviocytes as determined by flow 
cytometry ranged from -1 1% to -27% with percent delivery increasing with increasing doses 
of agent. 

L8 and rat skin fibroblasts (RSF) that had been transfected with ACes were grown 
under hygromycin B selection and analyzed for lacZ expression. While in this example, a 
hygromycin selection gene was included in the ACes, there are numerous other selectable 
marker genes that may be used in connection with the transfer of heterologous nucleic acids 
into cells when it is desirable to include such genes. Such selection systems are known to 
those of skill in the art. A choice of selectable marker gene can, for instance, take into 
account the level of toxicity of the selection agent on the host cell for transfection. 
Identification of an appropriate selectable marker gene is routine employing the guidance 
provided herein. 

Clones of L8 cells and RSF cells expressing lacZ were identified. These results 
demonstrate that IdUrd-labelled ACes can be delivered efficiently into primary cells as well 
as cell lines and that transgenes contained in the ACes are expressed in the transfected cells. 
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EXAMPLE 8 
Ex vivo transfer of reporter genes into rat joints 

To examine transfer of a heterologous gene into an in vivo environment and 
expression of the gene in vivo, L8 cells transfected with ACes as described above were 
injected into the ankle joint of rats with adjuvant-induced arthritis. On day 0, adjuvant 
induction of arthritis was performed on Lewis rats. Methods for adjuvant induction of 
arthritis in animal models are known in the ait [see, e.g., Kong et al (1999) Nature 4023:304- 
309]. In one exemplary protocol for adjuvant induction of arthritis, Lewis rats are immunized 
at the base of the tail with 1 mg Mycobacterium tuberculosis H37 RA (Difco, Detroit, 
Michigan) in 0.1 ml mineral oil on day 0. Paw swelling typically begins around day 10. 

On day 12, intra-articular injection of transfected L8 cells (-0.7 x 106 cells) or 
untransfected control cells into the right ankle joint was performed. On day 14, the rats were 
sacrificed in order to analyze the joints for the presence of transplanted transfected L8 cells. 

Different tissues of the sacrificed rats were examined by RT-PCR analysis for the 
presence of lacZ mRNA. Total mRNA was extracted from the tissues and RT-PCR was 
performed using primers specific for the lacZ gene. The an amplification product was 
detected only in the synovium, and not in the other tissues (liver, kidney, heart, spleen and 
lung). Synovium from the sacrificed rats was also analyzed by in situ enzymatic staining X- 
gal staining for p-gal activity. After snap freezing of the synovium, 8 um and 20 um sections 
were cut, counter-stained with Mayer 's hematoxylin, and analyzed for blue staining of the 
cells. Staining was detected in synovium injected with L8 cells that had been transfected 
with ACes but not in synovium injected with untransfected cells. These results demonstrate 
successful ex vivo gene transfer in a rat adjuvant arthritis model using ACes containing a 
marker gene and thus the feasibility of treating arthritis and other connective tissue diseases 
using ACes as non-viral vectors for gene therapy. 

EXAMPLE 9 

A flow cytometry technique for measuring delivery of artificial chromosomes 

Production cells lines (see Example 1) were grown in MEM medium (Gibco BRL) 
with 10% fetal calf serum (Can Sera, Rexdale ON) with 0.168 bg/ml hygromycin B 
(Calbiochem, San Diego, CA). Iododeoxyuridine or Bromodeoxyuridine was added directly 
to culture medium of the production cell line (CHO E42019) in the exponential phase of 
growth. Stock Iododeoxyuridine was made in tris base pH 10. Bromodeoxyuridine stocks in 
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PBS. Final concentrations of 0.05-1 pM for continuous label of 20-24 hours of 5-50 pM with 
15 minute pulse. After 24 hours, exponentially growing cells were blocked in mitosis with 
colchicine (1 .0 bg/ml for 7 hours before harvest. Chromosomes were then isolated and 
stained with Hoechst 33258 (2.5 bg/ml) and chromomycin A3 (50 bg/ml). Purification of 
artificial chromosomes was performed using a FACS Vantage flow cytometer (Becton 
Dickinson Immunocytometry systems, San Jose, CA). Chromomycin A3 was excited with the 
primary laser set at 457 nm, with emission detected using 475 nm long pass filter. Hoechst 
was excited by the secondary UV laser and emission detected using a 420/44 nm band-pass 
filter. Both lasers had an output of 1 50 mW. Bivariate distribution showing cell karyotype 
was accumulated from each sort. ACes were gated from other chromosomes and sorted. 
Condensing agents (hexylene glycol, spermine, and spermidine) were added to the sheath 
buffer to maintain condensed intact chromosome after sorting. IdU labeling index of sorted 
chromosomes was determined microscopically. Aliquot (2-10 ul) of sorted chromosomes 
was fixed in 0.2% formaldehyde solution for 5 minutes before being dried on clean 
microscopic slide. Microscope sample was fixed with 70% ethanol. Air-dried slide was 
denatured in coplin jar with 2N HC1 for 30 minutes at room temperature and washed 2-3 
times with PBS. Non specific binding was blocked with PBS and 4% BSA or serum for 
minimum of 10 minutes. A 1/5 dilution of FITC conjugated IdU/BrdU antibody (Becton 
Dickinson) with a final volume of 60-100 ul was applied to slide. Plastic strips, Durra seal 
(Diversified Biotech, Boston, MA) were overlaid on slides, and slides were kept in dark at 
4%C in humidified covered box for 8-24 hours. DAPI (Sigma) 1 bg/ml in Vectorshield was 
used as counterstain. Fluorescence was detected using Zeiss axioplan 2 microscope equipped 
for epiflorescence. Minimum of 100 chromosomes was scored for determining % labeled. 
Unlabeled chromosomes were used as negative control. 

The day before the transfection, trypsinize V79-4 (Chinese Hamster Lung fibroblast) 
cells and plate at 250,000 into a 6 well petri dish in 4 mis DMEM (Dulbecco «s Modified 
Eagle Medium, Life Technologies) and 10% FBS (Can Sera Rexdale ON). The protocol was 
modified for use with LM (tk-) cell line by plating 500,000 cells. Lipid or dendrimer 
reagentwas added to 1 X106 ACes sorted in -800 ul sort buffer. Exemplary protocol 
variations are set forth in Table 1 . Chromosome and transfection agents were mixed gently. 
Complexes added to cells drop-wise and plate swirled to mix. Plates were kept at 37°C in a 
5% C02 incubator for specified transfection time. The volume in a well was then made up to 
4-5 ml with DMEM and 10% FBS. Recipient cells left for 24 hours at 37°C in a 5% C02 
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incubator. Trypsinize transfected cells. Samples to be analyzed for IdU labeled chromosome 
delivery are fixed in cold 70% ethanol and stored at -20°C, to be ready for IdU antibody 
staining. Samples to be grown for colony selection are counted and then transferred to 10-cm 
dishes at densities of 10,000 and 100,000 cells in duplicate with remaining cells put in a 1 5 
cm dish. After 24 hours, selection medium containing of DMEM and 10% FBS with 0.7 
mg/ml hygromycin B, # 400051 (Calbiochem San Diego, CA) is added. Selection medium is 
changed every 2-3 days. This concentration of hygromycin B kills the wild type cells after 
selection for 7 days. At 10-14 days colonies were expanded and then screened by FISH for 
intact chromosome transfer and assayed for beta galactosidase expression. 



Table 1: Delivery Transfection Protocols 



Agent 


Dilution 


Pre 
treatment 
of ACes 


Complexing 
time 
(minutes) 


Added to 
complexes 


Medium 
(ml) added 

to wells 
before 
complexes 


Transfection 
time (hours) 


CLONFECTIN 


2-8 ug in 

NaCl-HEPES 

buffer 




20 


1.8 ml of 
serum free 
medium 




4 


CYTOFECTENE 






10-20 


200 \il of 
50% FBS 
plus DMEM 




24 


ENHANCER + 
EFFECTENE (1:5 
ratio) 




Enhancer 
5 minutes 


10 




1.2 


J 


EU-FECTIN-1 to 
11 






5-10 






6 


FUGENE 6 


0.5-6 ulto 
final volume 
of 100 ulin 
serum free 
medium 




15-45 






4 


GENEPORTER 2 


2.5 ul added 
to 150 ul of 
serum free 
medium 




2-10 






2-4 


LIPOFECTAMIN 
E 






15 






3 


LIPOFECTAMIN 
E2000 






20 




2.5 


5 


METAFECTENE 


diluted into 
60 ul serum 
free medium 




15-45 




0.8 


6 


PLUS + 

LIPOFECTAMIN 
E(l:l and 3:2 
ratio) 

SUPERFECT 




PLUS and 
200 ul of 

DMEM 
for 15 

minutes 


15 
10 




0.6 


3 
3 
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IDU ANTIBODY LABELING 

A standard BrdU staining flow cytometry protocol (Gratzer et al. Cytometry 
(1981);6:385-393) was used except with some modifications at the neutralization step, the 
presence of detergent during denaturation and the composition of blocking buffer. Between 
each step samples are centrifuged at 300 g for 7-10 minutes and supernatant removed. 
Samples of 1-2 million cells are fixed in 70% cold ethanol. Cells are then denatured in 1-2 
ml of 2N HCL plus 0.'5% triton X for 30 minutes at room temperature. Sample undergoes 3- 
4 washes with cold DMEM until indictor is neutral. Final wash with cold DMEM plus 5% 
FBS. Blocking/permeabilization buffer containing PBS, 0.1% triton X and 4%. FBS is 
added for 10-15 minutes before pelleting sample by centrifugation. Add 20 ul of IdU/BrdU 
FITC conjugated B44 clone antibody (Becton Dickinson Immunocytometry Systems, San 
Jose, CA) to pellet and leave for 2 hours at room temperature in the dark with agitation every 
30 minutes. Wash cells with block/permeabilization buffer and resuspend in PBS for flow 
analysis. 

FLOW CYTOMETRY DETECTION OF FLUORESCENT IDUrdD LABELED 

ACes 

Percentage of transfected cells containing IdU labeled ACes was determined using a 
flow cytometry with an argon laser turned to 488 nm at 400 mW. FITC fluorescence was 
collected through a standard FITC 530/30-nm band pass filter. Cell populations were gated 
on the basis of side scatter versus forward scatter to exclude debris and doublets. Data was 
accumulated (15,000 events) to form bivariate channel distribution showing forward scatter 
versus green fluorescence (IdU-FITC). The fluorescence level at which cells were 
determined to be positive was established by visual inspection of the histogram of negative 
control cells, such that approximately 1% appeared in the positive region. 
Results: 

The transfection delivery results of IdU labeled ACes are set forth in Table 2. 
Table 2 



COMPOUND 



CLONFECTIN 



CYTOFECTENE 



ENHANCER + EFFECTENE (1 .5) 



DOSE 



Microliters agent added per 1 



8 



EU-FECTIN-1 



EU-FECTIN-2 



1.6,10 



10 



DELIVERY 



% IdU positive (24 hours) 



0.61 



14.67 



17.08 



4.57 



0.14 
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EU-FECTIN-3 


10 


0.69 


EU-FECTIN-4 


10 


0.24 


EU-FECTIN-5 


10 


0.41 


EU-FECTIN-6 


10 


0.46 


EU-FECTIN-7 


10 


1.21 


EU-FECTIN-8 


10 


1.58 


EU-FECTIN-9 


10 


0.6 


EU-FECTIN-IO 


10 


0.77 


EU-FECTIN-ll 


5 


1 


FUGENE 


8 


0.49 


GENEPORTER 


5 


22.12 




LIPOFECTAMINE 


26 


17.81 


LIPOFECTAMINE 2000 


30 


10.96 


PLUS + LIPOFECTAMINE (l : l) 


12. 12 


12.2 


PLUS + LIPOFECTAMINE (3:2) 


24.16 


26.97 


METAFECTENE 


10 


14.14 


SUPERFECT 


2 


27.67 





EXAMPLE 10 

Delivery of ACes into LM(tk-) and Hff cells by cell synchronization 

A mammalian (murine) derived 60Mb ACes artificial chromosome was used for all 
transfections. The ACes artificial chromosome consists primarily of mouse major satellite 
DNA, interspersed with blocks of 'payload ' genetic material, including a reporter gene 
(lacZ) and a hygromycin B selectable marker gene. The ACes were prepared and purified as 
previously described herein and delivered into synchronized, immortalized murine LM(tk-) 
and primary human foreskin fibroblast (Hff) cells. Two common cell cycle arrest agents, 
thymidine and nocodazole, were chosen for cell synchronization. Thymidine incorporation 
accumulates cells in G0-G1 phase, and nocadozole acts by arresting cells in G2-M. Removal 
of the arresting agents allows the cells to continue through the cell cycle in a synchronous 
manner for approximately 24 hours. Two cationic transfection agents were used to deliver 
the ACE systems into cells, Superfect (Qiagen, Inc., Mississauga, Ontario) and 
LipofectAMINE PLUS (Gibco BRL). Transfection with these agents was carried out 
according to manufacturer's protocols. 

Nocodazole Cell Synchronization 

Optimization of Nocodozole Cell Synchronization in LM(tk-) Cells 
To determine the optimum blocking concentration of nocodazole, nocodazole 
concentrations were titrated against LM(tk-) cells to generate a dose response. Timing of the 
optimal blocking concentration (100 ng/ml) was then assessed over 0-16 hours using flow 
cytometry. To determine the cell cycle kinetics of LM(tk-) cell populations at different time 
points, cells were stained with propidium iodide (PI) at a concentration of 20 ng/ml for 1x106 
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cells and analyzed with a FACS Vantage SE (Becton Dickinson Immunocytometry Systems, 
San Jose, CA) equiped with an argon laser tuned to 488 nm at 200 mW. Whole intact cells 
were gated on forward scatter (FSC, representing cell size) versus side scatter (SSC, 
representing internal granularity) and PI emission was detected with a 630/22 filter. A list 
mode file was generated with a rninimum of 20,000 whole intact cells analyzed. The 
proportion of cells in G0-G1, S and G2-M were documented for each time point. Optimum 
cell-cycle arrest at the G2/M boundary was achieved by incubating cells with 100 ng/ml 
nocodazole in serum containing DMEM medium for 6 hours. After 6 hours of nocodazole 
exposure at 100 ng/ml, approximately 20% of cells were in S phase and 50% of cells were at 
G2/M. Cell cycle kinetics were again monitored by flow cytometry upon release of the 
nocodazole block and at frequent intervals over a 16 hour period. 

Optimizing the Time Point for Transfection in LM(tk-) cells 
Cells were plated at 7.5x105 cells per well of a 6-well plate 24 hours prior to 
nocodazole treatment. The cells were incubated with nocodazole for 6 hours. After the 6 
hour incubation, wells were hosed with medium to dislodge the suspension fraction of cells. 
Nocodazole containing medium was removed and the wells were washed with fresh medium. 
The wash medium was aspirated and fresh medium was replaced in the wells. To determine 
the optimum time point for transfection, different cell samples were transfected at different 
time points after release of the nocodazole block. The time points were 0, 2, 4, 6, 8, 12, and 
16 hours after release of the nocodazole block. Following transfer, the cells were plated and 
transfectants identified using hygromycin selection. Highest colony formations were 
achieved when transfection occurred 2-4 hours post nocodazole block. At these time points, 
cell population distribution consisted of approximately 10% of cells in G0/G1, 20% in S, and 
70% in G2/M. Transfectants were analyzed using FISH to determine the quality of the 
transferred material. Randomly selected drug resistant colonies were analyzed and uie 
number of colonies where at least 50% of the cells had an intact ACes were scored. Selected 
colonies derived from nocodazole synchronized transfections had a larger percentage of cells 
that maintained intact ACes compared with asynchronous control colonies. With both 
LipofectAMINE PLUS or Superfect, the percentage of colonies meeting this criteria was 4% 
(2/49) for asynchronous control colonies and 43% (20/47) for nocodazole synchronized 
colonies, p-galactosidase staining was used to confirm expression of the reporter gene. 
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Delivery of ACes into LM(tk-) cells by cell synchronization. 

Cells were synchronized by nocodazole treatment (lOOng/ml final concentration) for 6 
hours prior to transfection. The mitotic cells are gently removed by rinsing the plate with 
growth medium. Adherent cells were transfected at 2-hour intervals (approximately 1 million 
cells per interval) with Superfect (lOul) over an 8-hour time course. Twenty-four hours post- 
transfection the cells are counted (Coulter counter) and a small fraction are plated for a 
modified toxicity assay. The remaining cells were plated separately in growth medium and 
24 hours later were placed in selection medium (growth medium with lmg/ml hygromycin 
B). The selection medium was changed every 2-3 days until colonies formed. 
LipofectAMINE PLUS concentrations used were 12ul for both the LipofectAMINE and 
PLUS reagent. Transfection conditions were otherwise identical and selection of LM(tk-) 
cells was acomplished as described above. 

Delivery of ACes into Hff cells by cell synchronization. 
The protocol for Hff cells differs from that used for LM(tk-) cells in the following 
details: nocodazole concentration was 45ng/ml for an 8 hour exposure; 4ul Superfect was 
used; Transfection were carried out an half hour intervals over a two hour time frame; colony 
selection was not done as the Hff cells are hygromycin B sensitive. Cells were transfected 
with either 4ul Superfect or lOul LipofectAMINE PLUS (lOul for each component 
LipofectAMINE and PLUS; Invitrogen). Transfections were carried out every half hour over 
a 2-hour timeframe (cells returned to logarithmically growing state after this short time). The 
Hff cells are sensitive to hygromycin B therefore transfections were carried out using the RFP 
ACes and expression was monitored by flow cytometry. 
Thymidine Cell Synchronization 

Optimization of Thymidine Cell Synchronization in LM(tk-) Cells 
To determine the optimum parameters for cell cycle synchronization, thymidine was 
administered to cell samples in a dose range of 1-20 mM. Samples were analyzed by flow 
cytometry to determine the cell cycle kinetics of LM(tk-) cell populations at frequent 
intervals over a 1 6 hour time period. Optimum synchronization of cells in G0/G1 was 
achieved by incubating LM(tk-) cells with 10 mM thymidine in serum containing DMEM 
medium for 16 hours. The thymidine block was released by washing the cells twice in 
serum-free medium and replacing the medium with complete medium containing 24 uM 2 '- 
deoxycytidine 5 'mono-phosophate (dCMP; Sigma D-7750, St. Louis, MO). Cell cycle 
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kinetics were again monitored by flow cytometry upon release of the thymidine block and at 
frequent intervals over a 16 hour period. 

Optimizing the Time Point for Transfection in LM(tk-) Cells 
Cells were plated at 7.5x105 cells per well of a 6-weU plate 24 hours prior to 
thymidine treatment. The ceUs were incubated with medium containing 10 mM thymidine 
for 16 hours. After the 16 hour incubation, the thymidine block was released by washing the 
cells twice in serum-free medium and replacing the medium with 24 uM deoxycytidine 5 
'mono-phosophate (dCMP) supplemented complete medium. To determine the optimum 
time point for transfection, different cell samples were transfected at different time points 
after release of the thymidine block. The time points were 0, 2, 4, and 6 hours after release of 
the thymidine block. Following transfer, the cells were plated and transfectants identified 
using hygromycin selection. Transfectants were analyzed using FISH to determine ACE 
system integrity and p- galactosidase staining to confirm expression of the reporter gene. 
Optimal transfection was achieved at 4-5 hours post thymidine release. 
Hff cells 

A protocol similar to that used with the LM(tk-) cells was employed with the 
exception of an alteration in the dose of thymidine (5mM). 
Results 

Studies indicate successful achievement of reversible arrest of the LM(tk-) and Hff 
cells with both nocodazole and thymidine. Studies further show that synchronization and 
release of cells at specific points in the cell cycle increases the efficiency of ACes 
transfection into LM(tk-) and Hff cells. Using nocodazole or thymidine with either of two 
transfection agents (Superfect of LipofectAMINE PLUS), transfection efficiencies were 
enhanced 2.5-6 fold above control transfections on asynchronous cell populations. 
Transfection efficiency was strongly correlated with the fraction of cells in G2/M (R2 = 
0.827), regardless of which blocking agent was used suggesting that optimal transfection is 
achieved when the majority of the cells are in the G2/M phase of the cell cycle, regardless of 
where the cell population is originally arrested. Qualitative assessments of the chromosomes 
also revealed that delivery of ACes to cells during the optimized transfection point led to 
higher proportions of intact artificial chromosomes compared to asynchronous controls. 

Since modifications will be apparent to those of skill in this art, it is intended that this 
invention be limited only by the scope of the appended claims. 
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CLAIMS: 

1 . A method for introducing a large nucleic acid molecule into a cell, comprising: 
obtaining a cell that is in a pre-selected phase; and 

contacting the cell with the large nucleic acid molecule, whereby the large nucleic 
acid molecule is delivered into the cell. 

2. The method of claim 1 , wherein the nucleic acid molecule is greater than about 0.6 
megabase. 

3 . The method of claim 1 , wherein the nucleic acid molecule is greater than about 1 
megabase. 

4. The method of claim 1 , wherein the nucleic acid molecule is greater than about 5 
megabases. 

5. The method of claim 1, wherein the nucleic acid molecule is a natural chromosome, 
an artificial chromosome, or a fragment of a chromosome that is greater than about 0.6 
megabase or naked DNA that is greater than about 0.6 megabase. 

6. The method of claim 1 , wherein the nucleic acid molecule is an artificial 
chromosome. 

7. The method of claim 1, wherein the nucleic acid molecule is an artificial chromosome 
expression system (ACes). 

8. The method of claim 1, wherein the nucleic acid molecule and/or the cell is exposed 
to a delivery agent. 

9. The method of claim 8, wherein the delivery agent comprises a cationic compound. 

10. The method of claim 9, wherein the cationic compound is selected from the group 
consisting of a cationic lipid, a cationic polymer, a mixture of catiomc lipids, a mixture of 
catiomc polymers, a mixture of a cationic lipid and a cationic polymer, a mixture of a catiomc 
lipid and a neutral lipid, polycationic lipids, non-liposomal forming lipids, activated 
dendrimers, and a pyridinium chloride surfactant. 

11. The method of claim 8, wherein the delivery agent is a composition that comprises 
one or more cationic compounds, wherein the compound is selected from the group 
consisting of 2,3-dioleyloxy-N-[2(spermme-carboxanndo)emyl]-N,N-dimethyl-l- 
propanaminiumtrifluoroacetate (DOSPA), C52H106N6O4.4CF3CO2H, 
C 88 Hi78N804S2.4CF 3 C0 2 H, C^s^OsP.CFsCOzH, C50H103N7O3.4CF3CO2H, 
C 55 H,i6N80 2 .6CF 3 C02H, C49H102N6O3.4CF3CO2H, C44H89Ns03.2CF 3 C0 2 H, 
C 10 oH206Ni 2 04S2.8CF3C0 2 H, C4iH 7 8N0 8 P) C162H330N22O9.I3CF3CO2H, 
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C 43 H 8 8N40 2 .2CF3C0 2 H, C43H 8 8N403.2CF 3 C0 2 H, and (l-methyl-4-(l-octadec-9-enyl- 
nonadec-10-enylenyl) pyridinium chloride. 

12. The method of claim 1 , wherein: 

the nucleic acid molecule is exposed to an agent that increases contact between the 
nucleic acid molecule and the cell; and 

the cell is exposed to an agent that enhances permeability of the cell. 

13 . The method of claim 12, wherein the exposure of the cell to an agent that enhances 
permeability comprises applying ultrasound or electrical energy to the cell. 

14. The method of claim 1, wherein the cell is selected from the group consisting of a 
nuclear transfer donor cell, a stem cell, a primary cell, a cell from an immortalized cell line, 
an embyronic cell, a tumor cell, atransformed cell and a cell capable of the generation of a 
specific organ. 

15. The method of claim 1, wherein the cell is selected from the group consisting of a 
primary cell, an immortalized cell, an embryonic cell, a stem cell, a transformed cell and a 
tumor cell. 

16. The method of claim 1, wherein the cell is selected from the group consisting of a nuclear 
transfer donor cell, a stem cell, and a cell capable of the generation of a specific organ. 

17. The method of claim 1 , wherein the cell is a mammalian cell. 

18. The method of claim 1 , wherein the cell is a rodent cell or a human cell. 

1 9. The method of claim 1 , wherein the cell is a fibroblast. 

20. The method of claim 1 , wherein the cell is a synoviocyte. 

21. The method of claim 20, wherein the cell is a fibroblast-like synoviocyte. 

22. The method of claim 1, wherein the pre-selected phase of the cell is determined by a 

method comprising: 

introducing a large nucleic acid into cells at different phases; 

determining and comparing the efficiency of delivery and/or of transfection of the 
nucleic acid into the cells at different phases; and 

selecting a phase at which the efficiency of delivery and/or of transfection is increased 
relative to efficiency of delivery and/or transfection at other phases. 

23 . The method of claim 1 , wherein the cell that is in a pre-selected phase is obtained by a 

method comprising: 

exposing one or more cells to a cell cycle arrest agent; 

exposing the one or more cells to conditions that permit cell cycling; 
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selecting a cell that is in the pre-selected phase. 

24. The method of claim 23, wherein the arrest agent is an anti-microtubule agent. 

25. The method of claim 23, wherein the arrest agent is nocodazole or thymidine. 

26. The method of claim 1 or 23, wherein the pre-selected phase is G2/M. 

27. The method of claim 23, wherein exposure of the one or more cells to an arrest agent 
results in arrest of the one or more cells in mitosis or G0/G1. 

28. The method of claim 1, wherein the nuclear membrane of the cell is absent. 

29. The method of claim 1 , wherein a plurality of cells in a pre-selected phase is obtained. 

30. The method of claim 29, wherein the plurality of cells is in a synchronous population 
of cells. 

31 . The method of claim 30, wherein the efficiency of transfection of the large nucleic 
acid molecule to cells in a synchronous population of cells is at least 1 .5, at least 2, at least 
2.5, at least 3, at least 3.5, at least 4, at least 4.5, at least 5, at least 5.5, at least 6, at least 6.5, 
at least 7, at least 7.5, at least 8, at least 8.5, at least 9, at least 9.5 or at least 10-fold greater 
than the efficiency of transfection of the large nucleic acid molecule to cells in an 
asynchronous population of cells. 

32. The method of claim 30, wherein the large nucleic acid molecule is a chromosome or 
artificial chromosome and the percentage of cells comprising an intact delivered chromosome 
after contact of a synchronous population of cells with the chromosome or artificial 
chromosome is greater than.the percentage of cells comprising an intact delivered 
chromosome after contact of an asynchronous population of cells with the chromosome or 
artificial chromosome. 

33 . The method of claim 32 wherein the artificial chromosome is an ACes. 

34. A method for introducing a chromosome or functional fragment thereof into a cell, 
comprising contacting the cell with a chromosome or fragment thereof that is in the same or 
similar phase as the cell, whereby the chromosome or functional fragment thereof is delivered 
into the cell. 

35. The method of claim 34 wherein the chromosome is an artificial chromosome. 

36. The method of claim 35, wherein the chromosome is an ACes. 

37. A method for ex vivo gene therapy, comprising: 
obtaining a cell that is in a pre-selected phase; 

contacting the cell with a nucleic acid molecule, whereby the nucleic acid molecule is 
delivered into the cell; and 
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introducing the cell into a subject. 

38. The method of claim 37, wherein the nucleic acid molecule is a large nucleic acid 
molecule. 

39. The method of claim 37, wherein the nucleic acid molecule is greater than about 0.6 
megabase. 

40. The method of claim 37, wherein the nucleic acid molecule is greater than about 1 
megabase. 

41 . The method of claim 37, wherein the nucleic acid molecule is greater than about 5 
megabases. 

42. The method of claim 37, wherein the nucleic acid molecule is a natural chromosome, 
an artificial chromosome, or a fragment of a chromosome that is greater than about 0.6 
megabase or naked DNA that is greater than about 0.6 megabase. 

43 . The method of claim 3 7, wherein the nucleic acid molecule is an artificial 
chromosome. 

44. The method of claim 37, wherein the nucleic acid molecule is an artificial 
chromosome expression system (ACes). 

45. The method of claim 37, wherein the nucleic acid molecule and/or the cell is exposed 
to a delivery agent. 

46. The method of claim 45, wherein the delivery agent comprises a cationic compound. 

47. The method of claim 46, wherein the cationic compound is selected from the group 
consisting of a cationic lipid, a cationic polymer, a mixture of cationic lipids, a mixture of 
cationic polymers, a mixture of a cationic lipid and a cationic polymer, a mixture of a cationic 
lipid and a neutral lipid, polycationic lipids, non-liposomal forming lipids, activated 
dendrimers, and a pyridinium chloride surfactant. 

48. The method of claim 45, wherein the delivery agent is a composition that comprises 
one or more cationic compounds, wherein the compound is selected from the group 
cor^istmgof2,3-dioleyloxy-N42(spenrnne-carboxamido)emyl]-N,N-dimethyl-l- 

propanaminiumtrifluoroacetate (DOSPA), C52H106N6O4.4CF3CO2H, 
C 88 H 178 N80 4 S 2 .4CF3C0 2 H, GwHwNOaP.CFsCOjH, CS0H103N7O3.4CF3CO2H, 
C 5 5Hn6N80 2 .6CF3C0 2 H, C 49 Hi02N 6 O3.4CF 3 CO 2 H, OhH^NsC^CFsCC^H, 
C 10 oH 2 o6N 12 0 4 S2.8CF3C0 2 H, C 41 H 78 N0 8 P) C 162 H 3 3oN220 9 .13CF 3 C0 2 H, 
C 43 H 88 N 4 0 2 .2CF 3 C0 2 H, C 4 3H 88 N 4 0 3 .2CF 3 C0 2 H, and (l-methyl-4-(l-octadec-9-enyl- 
nonadec-10-enylenyl) pyridinium chloride. 
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49. The method of claim 37, wherein: 

the nucleic acid molecule is exposed to an agent that increases contact between the 
nucleic acid molecule and the cell; and 

the cell is exposed to an agent that enhances permeability of the cell. 

50. The method of claim 49, wherein the exposure of the cell to an agent that enhances 
permeability comprises applying ultrasound or electrical energy to the cell. 

51. The method of claim 37, wherein the cell is a mammalian cell. 

52. The method of claim 37, wherein the cell is a human cell. 

53. The method of claim 37, wherein the cell is a synoviocyte. 

54. The method of claim 37, wherein the cell is a fibroblast-lilce synoviocyte. 

55. The method of claim 37, wherein the pre-selected phase is determined by a method 
comprising: 

introducing nucleic acid into cells at different phases; 

deterrnining and comparing the efficiency of delivery and/or transfection of the 
nucleic acid into the cells at different phases; and 

selecting a phase at which the efficiency of delivery and/or transfection is increased 
relative to efficiency of delivery and/or transfection at other phases. 

56. The method of claim 37, wherein the cell that is in a pre-selected phase is obtained by 

a method comprising: 

exposing one or more cells to a cell cycle arrest agent; 

exposing the one or more cells to conditions that permit cell cycling; 

selecting a cell that is in the pre-selected phase. 

57. The method of claim 56, wherein the arrest agent is an anti-microtubule compound. 

58. The method of claim 56, wherein the arrest agent is nocodazole or thymidine. 

59. The method of claim 37 or 56, wherein the pre-selected phase cycle is G2/M. 

60. The method of claim 56, wherein exposure of the one or more cells to an arrest agent 
results in arrest of the one or more cells in mitosis or G0/G1. 

61 . The method of claim 37, wherein the nuclear membrane of the cell is absent. 

62. The method of claim 37, wherein a plurality of cells in a pre-selected phase is 
obtained and the plurality of cells is contacted with one or more nucleic acid molecules, 
whereby the nucleic acid molecule(s) is delivered into one or more cells, and one or more 
cells are introduced into the subject. 
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63. The method of claim 62, wherein the plurality of cells is in a synchronous population 
of cells. 

64. The method of claim 63, wherein the efficiency of transfection of the nucleic acid 
molecule to cells in a synchronous population of cells is at least 1.5, at least 2, at least 2.5, at 

5 least 3, at least 3.5, at least 4, at least 4.5, at least 5, at least 5.5, at least 6, at least 6.5, at least 
7, at least 7.5, at least 8, at least 8.5, at least 9, at least 9.5 or at least 10-fold greater than the 
efficiency of delivery of me nucleic acid molecule to cells in an asynchronous population of 
cells. 

65. The method of claim 63, wherein the nucleic acid molecule is a chromosome or 

10 artificial chromosome and the percentage of cells comprising an intact delivered chromosome 
after contact of a synchronous population of cells with the chromosome or artificial 
chromosome is greater than the percentage of cells comprising an intact delivered 
chromosome after contact of an asynchronous population of cells with the chromosome or 

artificial chromosome. 
1 5 66. The method of claim 65 wherein the artificial chromosome is an ACes. 

67. A method for ex vivo gene therapy, comprising: 

contacting a cell with a chromosome or fragment thereof that is in the same or similar 
phase as the cell, whereby the chromosome or functional fragment thereof is delivered into 

the cell; and 
20 introducing the cell into a subject. 

68. The method of claim 67 wherein the chromosome is an artificial chromosome. 

69. The method of claim 68, wherein the chromosome is an ACes. 

70. A kit for delivering large nucleic acids into cells, comprising: 
a delivery agent; 

25 a cell cycle arrest agent; and 

optionally instructions for delivering large nucleic acids into cells. 

71 . A method for selecting a host cell for receiving a large nucleic acid molecule, 
comprising: 

introducing a large nucleic acid molecule into cells at different phases; 
30 detenruning and comparing the efficiency of delivery and/or transfection of the 

.nucleic acid into the cells at different phases; and 

selecting a cell that is in a phase that provides for increased efficiency of delivery 
and/or transfection relative to the efficiency of delivery and/or transfection at other phases. 
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72. The method of claim 7 1 , wherein the large nucleic acid molecule is labeled. 

73 . The method of claim 72, wherein the labelled cells are detected by flow cytometry, 
fluorimetry, cell imaging or fluorescence spectroscopy. 

74. The method of claim 73, wherein the labelled cells are detected by flow cytometry. 

75. The method of claim 73, wherein the label is iododeoxyuridine (IdU or IdUrd) or 
bromodeoxyuridine (BrdU). 

76. The method of claim 73, wherein the large nucleic acid molecule is a chromosome or 
functional fragment thereof. 

77. The method of claim 73, wherein the large nucleic acid molecule is an artificial 
chromosome. 

78. The method of claim 77, wherein the artificial chromosome is an ACes. 

79. The method of any of claims 1-33, 37-47 or 56, wherein the pre-selected phase of the 
cell is selected from the group consisting of Gl, S, G2, G2/M, M and any of the stages of the 
M phase. 

80. The method of claim 1, 23, 47, or 56, wherein the pre-selected phase of the cell cycle 
is not GO. 

81. The method of any of claims 1-33, 37-66 and 71-78, wherein the large nucleic acid is 
associated with one or more proteins. 

82. The method of any of claims 34-36 and 67-69, wherein the chromosome or functional 
fragment thereof is associated with one or more proteins. 
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SEQUENCE LISTING 

<110> CHROMOS MOLECULAR SYSTEMS, INC. 
DE JONG/ Gary 
PERKINS, Edward 
TELENIUS , Adele 
MAC DONALD, Neil 

<120> METHODS FOR DELIVERING NUCLEIC ACID MOLECULES INTO CELLS AND 
ASSESSMENT THEREOF 

<130> 82828-5 

<140> To Be Assigned 
<141> 2003-04-30 

<150> US 60/377547 
<151> 2002-05-01 

<160> 13 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 1 

cgtccaggag cgcaccatct tctt 

<210> 2 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 2 

atcgcgcttc tcgttggggt cttt 

<210> 3 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 3 

aggactgggt ggcttccaac tcccagacac 

<210> 4 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Primer 
<400> 4 

agcttctcat tgctgcgcgc caggttcagg 

<210> 5 

<211> 22118 

<212> DNA 

<213> Mus musculus 



30 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 



<400> 5 ^ ^ ^ ^ 

gaattcccct atccctaatc cagattggtg gaataacttg gtatagatgt ttgtgcatta 
aaaaccctgt aggatcttca ctctaggtca ctgttcagca ctggaacctg aattgtggcc 
ctgagtgata ggtcctggga catatgcagt tctgcacaga cagacagaca gacagacaga 
cagacagaca gacagacgtt acaaacaaac acgttgagcc gtgtgccaac acacacacaa 
acaccactct ggccataatt attgaggacg ttgatttatt attctgtgtt tgtgagtctg 
tctgtctgtc tgtctgtctg tctgtctgtc tatcaaacca aaagaaacca aacaattatg 
cctqcctgcc tgcctgcctg cctacacaga gaaatgattt cttcaatcaa tctaaaacga 
cctcctaagt ttgccttttt tctctttctt tatctttttc ttttttcttt tcttcttcct 
tccttccttc cttccttcct tccttccttt ctttctttct ttctttcttt cttactttct 
ttctttcctt cttacattta ttcttttcat acatagtttc ttagtgtaag catccctgac 
tgtcttgaag acactttgta ggcctcaatc ctgtaagagc cttcctctgc ttttcaaatg 
ctggcatgaa tgttgtacct cactatgacc agcttagtct tcaagtctga gttactggaa 
aggagttcca agaagactgg ttatattttt catttattat tgcattttaa ttaaaattta 
atttcaccaa aagaatttag actgaccaat tcagagtctg ccgtttaaaa gcataaggaa 
aaagtaggag aaaaacgtga ggctgtctgt ggatggtcga ggctgcttta gggagcctcg 
tcaccattct gcacttgcaa accgggccac tagaacccgg tgaagggaga aaccaaagcg 
acctggaaac aataggtcac atgaaggcca gccacctcca tcttgttgtg cgggagttca 
gttagcagac aagatggctg ccatgcacat gttgtctttc agcttggtga ggtcaaagta 
caaccgagtc acagaacaag gaagtataca cagtgagttc caggtcagcc agagtttaca 
cagagaaacc acatcttgaa aaaaacaaaa aaataaatta aataaatata atttaaaaat 
ttaaaaatag ccgggagtga tggcgcatgt ctttaatccc agctctcttc aggcagagat 1260 
gggaggattt ctgagtttga ggccagcctg gtctgcaaag tgagttccag gacagtcagg 1320 
gctatacaga gaaaccctgt cttgaaaact aaactaaatt aaactaaact aaactaaaaa 
aatataaaat aaaaatttta aagaatttta aaaaactaca gaaatcaaac ataagcccac 
gagatggcaa gtaactgcaa tcatagcaga aatattatac acacacacac acacagactc 
tgtcataaaa tccaatgtgc cttcatgatg atcaaatttc gatagtcagt aatactagaa 
gaatcatatg tctgaaaata aaagccagaa ccttttctgc ttttgttttc ttttgcccca 
agatagggtt tctctcagtg tatccctggc atccctgcct ggaacttcct ttgtaggttt 
ggtagcctca aactcagaga ggtcctctct gcctgcctgc ctgcctgcct gcctgcctgc 
ctgcctgcct gcctgcctca cttcttctgc cacccacaca accgagtcga acctaggatc 
tttatttctt tctctttctc tcttctttct ttctttcttt ctttctttct ttctttcttt 
ctttctttct ttcttattca attagttttc aatgtaagtg tgtgtttgtg ctctatctgc 
tgcctatagg cctgcttgcc aggagagggc aacagaacct aggagaaacc accatgcagc 
tcctgagaat aagtgaaaaa acaacaaaaa aaggaaattc taatcacata gaatgtagat 
atatgccgag gctgtcagag tgctttttaa ggcttagtgt aagtaatgaa aattgttgtg 
tgtcttttat ccaaacacag aagagaggtg gctcggcctg catgtctgtt gtctgcatgt 2\W 
agaccaggct ggccttgaac acattaatct gtctgcctct gcttccctaa tgctgcgatt 22^U 
aaaggcatgt gccaccactg cccggactga tttcttcttt tttttttttt tggaaaatac 2280 
ctttctttct ttttctctct ctctttcttc cttccttcct ttctttctat tctttttttc 2340 
tttctttttt cttttttttt ttttttttaa aatttgccta aggttaaagg tgtgctccac 
aattgcctca gctctgctct aattctcttt aaaaaaaaac aaacaaaaaa aaaaccaaaa 
cagtatgtat gtatgtatat ttagaagaaa tactaatcca ttaataactc ttttttccta 
aaattcatgt cattcttgtt ccacaaagtg agttccagga cttaccagag aaaccctgtg 2b«u 
ttcaaatttc tgtgttcaag gtcaccctgg cttacaaagt gagttccaag tccgataggg 
ctacacagaa aaaccatatc tcagaaaaaa aaaaagttcc aaacacacac acacacacac 
acacacacac acacacacac acacacacac acacacacag cgcgccgcgg cgatgagggg 
aagtcgtgcc taaaataaat atttttctgg ccaaagtgaa agcaaatcac tatgaagagg 
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tactcctaga aaaaataaat acaaacgggc tttttaatca ttccagcact gttttaattt 2880 
aactctgaat ttagtcttgg aaaagggggc gggtgtgggt gagtgagggc gagcgagcag 

acgggcgggc gggogggtga gtggccggcg gcggtggcag cgagcaccag aaaacaacaa 3000 

accccaJgcg gtagagtgtt ttaaaaatga gacctaaatg tggtggaacg gaggtcgccg 3060 

ccaccctcc? cttccactgc ttagatgctc ccttcccctt actgtgctcc °ttcccctaa 3120 

ctgtgcctaa ctgtgcctgt tccctcaccc cgctgattcg ccagcgacgt actttgactt J1BO 

caagaacgat tttgcctgtt ttcaccgctc cctgtcatac tttcgttttt gggtgcccga 3240 

gtctagcccg ttcgctatgt tcgggcggga cgatggggac cgtttgtgcc actcgggaga 3300 

agtggtgggt gggtacgctg ctccgtcgtg cgtgcgtgag tgcoggaacc tgagctcggg 3360 

agaccc?ccg gagagacaga atgagtgagt gaatgtggcg gcgcgtgacg ^tctgtatt 3420 

ggtttgtatg gttgatcgag accattgtcg ggcgacacct agtggtgaca agtttcggga 3480 

acgctccagg cctctcaggt tggtgacaca ggagagggaa gtgootgtgg tgaggcgacc 3540 

agggtgacag gaggccgggc aagcaggcgg gagcgtctcg gagatggtgt cgtgtttaag 3600 

gXggtotct aacaaggagg tcgtacaggg agatggccaa agcagaccga ^tgotgtac 3660 

gcccttttgg gaaaaatgct agggttggtg gcaacgttac taggtcgacc agaaggctta 3720 

agtcctaccc ccccccccct tttttttttt tttcctccag aagccctctc ttgtccccgt 3780 

caccgggggc accgtacatc tgaggccgag aggacgcgat gggcccggct tccaagccgg 3840 

tgtggctcgg cca^ctggcg cttcgggtct tttttttttt tttttttttt ttttcctcca 3900 

gLgccttgt ctgicgctgt caccgggggc gctgtacttc tgaggccgag jggacgcgat 3960 

gggccccggc ttccaagccg gtgtggctcg gccagctgga gcttcgggtc tttttttttt 4020 

?ttttttttt tttttttctc cagaagcctt gtctgtcgct gtcaccgggg gcgctgtact 4080 

tctgaggccg agaggacgcg atgggtcggc ttccaagccg atgtggcggg gccagctgga 414 0 

gcttcgggtt tttttttttc ctccagaagc cctctcttgt ccccgtcacc gggggcgctg 4200 

?acttc?gag gccgagagga cgtgatgggc ccgggttcca SScggatgtc jcc^cag 4260 

ctggagcttt ggatcttttt tttttttttt cctccagaag ccctctcttg tccccgtcac 4320 

cgggggcacc ttacatctga gggcgagagg acgtgatggg tccggcttcc aagccgatgt 4380 

ggcglggcca gctggagctt cgggtttttt ttttttcctc cagaagccct ctcttgtccc 4 44 0 

Sgtcaccggg ggcgctgtac ttctgaggcc gagaggacgt gatgggcccg ggttccaggc 4500 

ggatgtcg?? cggLagctg gagctttgga tcattttttt ttttccctcc agaagccctc 4560 

??ttgtcccc gtcaccgggg gcaccgtaca tctgaggccg agaggacacg ^gggcctgt 4620 

cttccaagcc gatgtggccc ggccagctgg agcttcgggt cttttttttt ttttttcctc 4b«u 

cagaagcctt gtctgtcgct gtcacccggg gcgctgtact tctgaggccg agaggacgcg 4740 

atgggcccgg cttccaagcc ggtgtggctc ggccagctgg agcttcgggt cttttttttt 4800 

tttttttttt ttcctccaga aaccttgtct gtcgctgtca cccggggcgc ttgtacttct 48 60 

gatgccgaga ggacgcgatg ggcccgtctt ccaggccgat gtggcccggt "gctggagc 4920 

tttgga?ctt tttttttttt ttttcctcca gaagccctct cttgtccccg tcaccggggg 4 980 
caccttacat ctgaggccta gaggacacga tgggcccggg ttccaggccg atgtggcccg 
gtcagctgga gctttggatc tttttttttt ttttcttcca gaagccctct tgtccccgtc 
accg^tggca ctgtacatct gaggcggaga ggacattatg ggcccggctt ccaatccgat 
gtggcccggt cagctggagc tttggatctt attttttttt taattttttc "ccagaagc 
cc?cttg?cc ct|tcaccgg tggcacggta catctgaggc cgagaggaca "atgggccc 
ggcttccagg ccgatgtggc ccggtcagct ggagctttgg atcttttttt "ttttttct 
??tttcctcc agaagccctc tctgtccctg tcaccggggg ccctgtacgt ctgaggccga 
gggaaagcta tgggcgcggt tttctttcat tgacctgtcg gtcttatcag "ctccgggt 
?g?cagggtc glccagttgt tcctttgagg tccggttctt ttcgttatgg ^cattttt 

gggccacctc cccaggtatg acttccaggc gtcgttgctc gcctgtcact ttcctccctg 558 0 

JSctttt.t gcttg?gatc ttttctatct gttcctattg gacctggaga taggtactga 5640 

cacgctgtcc tttccctatt aacactaaag gacactataa agagaccctt tcgatttaag 5/uo 

gctgtt?tgc ttgtccagcc tattcttttt actggcttgg gtctgtcgcg ^gcctgaag 5760 

Stgtccccga gccacgettc ctgctttccc gggcttgctg cttgcgtgtg cttgctgtgg 5820 

gcagcttgtg acaactgggc gctgtgactt tgctgcgtgt cagacgtttt tcccgatttc 5880 

cccgaggtgt cgttgtcaca cctgtcccgg ttggaatggt ggagccagct gtggttgagg 5940 

gccaccttat ttcggctcac tttttttttt tttttttctc ttggagtccc 9J a ^ccgc 6000 

tcttttctct tcccggtctt tcttccacat gcctcccgag tgcatttctt tttgtttttt 6060 

ttcttttttt tttttttttt ttggggaggt ggagagtccc gagtacttca ctcctgtctg 6120 

tggtgtccaa gtgttcatgc cacgtgcctc ccgagtgcac ttttttttgt ^cagtcgct 6180 

cgttgtgttc tcttgttctg tgtctgcccg tatcagtaac tgtcttgccc cgcgtgtaag 6240 

acattcltat ctcgcttgtt tctcccgatt gcgcgtcgtt gctcactctt agatcgatgt 6300 

ggtgctccgg agttctcttc gggccagggc caagccgcgc caggcgaggg acggacattc 6360 

atggcgaatg gcggccgctc ttctcgttct gccagcgggc cctcgtctct ccaccccatc 6420 
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cgtctgccgg 
cgcacttttc 
tcacgtgttt 
ccggtggcgt 
gaggtgctcc 
gcgctcccca 
tgtctgagaa 
tcgtcgggtg 
ggtcgcggct 
gagaggcctg 
aatgcccttg 
ttggtcttct 
gtcggggttt 
ggaaagggtg 
ctcgccccct 
ggcctccccg 
ccgttgctgc 
gcacaccccc 
tgggtaggcg 
tccgtcgcgt 
ctgcgccgcg 
ccccccttcc 
cctcggggtc 
gttctgtggg 
gccgctcggg 
cggtgtcgcc 
ggtgtggtgg 
tgccctgacc 
gaggggcccg 
ccccctcccc 
acccgtggcc 
cggtcaccgg 
gagctgtggt 
gagagggctg 
agtggtcatt 
ccggccctgt 
accctggcgg 
gatgtctacc 
cctcgttcct 
catctctcgc 
tcgccggggg 
ctcgccggct 
gacgttgcgc 
gagcccctgc 
tgtgtcgcgt 
gacgggtggc 
tcgttggtgt 
tcgccggtgt 
cggcccggtg 
gggacggagg 
gttggctttg 
tccggccgca 
cctcccgcga 
cctggtcctg 
ggtagcatat 
agtgaaactg 
ctacttggat 
tcccgggggg 
ctccggccgg 
acgccccccg 



tggtgtgtgg 
tcagtggttc 
cactttggtc 
tgcataccct 
tggagcgttc 
ttccctggtg 
gcccgtgaga 
aggcgcccac 
ggggttggaa 
gctttcgggg 
gaagagaacc 
ggtttccctg 
tgggtccgtc 
cgggcttctt 
gaccgcctcc 
ctccgagttc 
ggagcatgtg 
gcgtgcgcgt 
acggtgggct 
gcgtccctct 
cgtggtgcgt 
cgcggcagcg 
gagagggtcc 
agaacggctg 
ggtcttcgtc 
tcctcgggct 
gactgctcag 
ggtccgacgc 
tttcggccgc 
gctcgccgca 
gtgctgtcgg 
ggtcttgggg 
ttggagggcg 
cgtgcgaggg 
gtcccgacgg 
cgtccgtcgg 
tgggattaac 
tccctctccc 
ccctctcgcg 
gcaatggcgc 
ctggccgctg 
tcgcggactc 
ctcgctgctg 
cgcacccgcc 
cgggagcgtg 
ctatccaggg 
ggggagtgaa 
cgcgcttctc 
cggtcgacgt 
ggagagcggg 
ccgcgtgcgt 
tgcactctcc 
ggctctccgc 
tcccaccccc 
gcttgtctca 
cgaatggctc 
aactgtggta 
ggatgcgtgc 
gggtcgggcg 
tggcggcgac 



aaggcagggg 
gcgtggtcct 
gtgtctcgct 
tcccgtctgg 
caggtttgtc 
tgcctccggt 
ggggggtcga 
cccgcgacta 
agtttctcga 
gggaccggtt 
ttcctgttgc 
tgtgctcgtc 
ccgccctcag 
acggtctcga 
cgcgcgcgca 
ggggagggat 
gctcggcttg 
actttcctcc 
cccgggtccc 
cgctcgcgtc 
gctgtgtgct 
ttcccacggc 
gtgtctggcg 
ttggccgcgt 
ggtaggcatc 
cccggggggc 
gggagtggtg 
ccgagcggtc 
ccttgccgtc 
gccggtcttt 
accccccgca 
gggggccgag 
tcccggcccc 
gaaaaggttg 
tgtggtggtc 
gaaggcgcgt 
cccgcgcgcg 
cgaggtctca 
gggttcaagt 
cgcccgagtt 
tccggtctct 
ctggcttcgc 
tgtgcttggg 
ggtgtgcggt 
tccgcctcgc 
ctcgcccccg 
tggtgctacc 
tttccgccaa 
tccggctctc 
taagagaggt 
gtgctcgcgg 
cgttccgcgc 
cgccgccgcc 
gacgctccgc 
aagattaagc 
attaaatcag 
attctagagc 
atttatcaga 
ccggcggctt 
gacccattcg 



tgcggctctc 
tgtggatgtg 
tgaccatgtt 
tgtgtgcacg 
tcctaggtgc 
gctccgtctg 
ggagagaagg 
gtacgcctgt 
gagactcatt 
gcagggtctc 
cgcagacccc 
gcatgcatcc 
tgagaaagtt 
ggggtctctc 
gcgtttgctc 
cacgcggggc 
tgtggttggt 
cctcctgagg 
cacccgtctt 
cacgactttg 
tctcgggctg 
tggcgaaatc 
ttgattgatc 
ccggcgcgac 
ggtgtgtcgg 
cgtcgtgttt 
cagtgtgatt 
tctcggtccc 
gtcgccggcc 
tttcctctct 
tgggggcggc 
gggtaagaaa 
gcggccgtgg 
ccccgcgagg 
tgttggccga 
gttggqgcct 
tgtcccggtg 
ggccttctcc 
cgctcgtcga 
cacggtgggt 
cctgcccgac 
ccggagggtc 
gggggcccgc 
ttcgcgccgc 
ggcggctaga 
ccgacccccg 
ggtcattccc 
cccccacgcc 
ccgatgccga 
gtcggagagc 
acgggttttg 
gagcgcccgc 
tcctcctcct 
tcgcgcttcc 
catgcatgtc 
ttatggttcc 
taatacatgc 
tcaaaaccaa 
ggtgactcta 
aacgtctgcc 
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cggcccgacg 
tgaggcgccc 
cccagagtcg 
cgctgtttct 
ctgcttctga 
gctgtgtgcc 
aggggcaaga 
gcgtagggct 
gctttcccgt 
ccctgtccgc 
cccgcgcggt 
tctctcggtg 
tccttctcta 
ccgaatggtc 
tctcgtctac 
agagcctgtc 
ggctggggag 
gccgccgtgc 
cccgtgcctc 
gccgctcccg 
tgtggttgtg 
gcgggagtcc 
tcgctctcgg 
gtcggacgtg 
catcggtctc 
cgggtcggct 
cccgccggtt 
ttgtgaggac 
ctcgttctgc 
ccccccctct 
cgggcacgta 
gtcggctcgg 
cggtgtcttg 
gcaaagggaa 
ggtgcgtctg 
gccggagtgc 
tggcggtggg 
gcgcgggctc 
cctcccctcc 
tcgtcctccg 
ccccgttggc 
agggggcttc 
tgcggcctcc 
ggtcagttgg 
cgcgggtgtc 
cctgcccgtc 
tcccgcgtgg 
aacccaccac 
ggggttcggg 
tgtcccgggg 
tcggaccccg 
ccggctcacc 
ctctcgcgct 
ttacctggtt 
taagtacgca 
tttggtcgct 
cgacgggcgc 
cccggtgagc 
gataacctcg 
ctatcaactt 



ctgccccgcg 
ggttgtgccc 
gtggatgtgg 
tgtaagcgtc 
gctggtggtg 
ttcccgtttg 
ccccccttct 
ggtgctgagc 
ggggagcttt 
ggatgctcag 
cgcccgcgtg 
gccggggctc 
gctatcttcc 
ccctggaggg 
cgcggcccgc 
tgtcgtcctg 
agggctccgt 
ggacggggtg 
acccgtgcct 
cgacggcggc 
tcgcctcgcc 
tccttcccct 
ggacgggacc 
gggacccact 
tctctcgtgt 
cggcgctgca 
ttgcctcgcg 
ccccttccgg 
tgtgtcgttc 
cctctgactg 
cgcgtccggg 
cgggcgggag 
cgcggtcttg 
agaggctagc 
gggggctcgt 
cgaggtgggt 
ggctccggtc 
tcggccctcc 
tccgtccttc 
cctccgcttc 
gtggtcttct 
ccggttcccc 
gcccgcccgt 
gccctggcgt 
gccgggctcc 
ccggtggtgg 
tttgactgtc 
cctgctctcc 
atttgtgccg 
cgacgctcgg 
acggggtcgg 
cccggtttgt 
ctctgtcccg 
gatcctgcca 
cggccggtac 
cgctcctctc 
tgacccccct 
tccctcccgg 
ggccgatcgc 
tcgatggtag 
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tcgccgtgcc 
gagcctgaga 
cgacccgggg 
ggaatgagtc 
cagccgcggt 
tagttggatc 
ccccttgcct 
gtttactttg 
aggaataatg 
taagagggac 
gcaagacgga 
tcggaggttc 
cgatgcggcg 
ggttccgggg 
ccaggagtgg 
cggacaggat 
ttcttagttg 
ctaactagtt 
gcgttcagcc 
tgcacgcgcg 
aacccgttga 
gaattcccag 
accgcccgtc 
ggtcggccca 
agtaaaagtc 
ctgtggagga 
cgcgtgcgtc 
gaaggggtgg 
tcccctctcc 
gcgtcttgcc 
ggtttttgac 
cccatccccg 
ggatgtgagt 
gtcctccccg 
ccccgggggg 
cggtcgttcg 
cccgaggcgg 
cccgacccgc 
gggttcccgt 
cacgtgtctc 
cctctctctc 
cgtgagttcg 
tgcgtcgatg 
catcgacact 
cgtcggttga 
ctcgcagggc 
gggcggttgt 
cgcgctcgcg 
gcctcgcgtc 
tgggaaccca 
gaggttggcg 
ggttgtcggg 
gtttgggtct 
ggcgccgcgc 
gtatccccgg 
cctcggtggg 
cgtggctctt 
ccgcgggacg 
gggagggaga 
ctgtgggctg 



taccatggtg 
aacggctacc 
aggtagtgac 
cactttaaat 
aattccagct 
ttgggagcgg 
ctcggcgccc 
aaaaaattag 
gaataggacc 
ggccgggggc 
ccagagcgaa 
gaagacgatc 
gcgttattcc 
ggagtatggt 
gcctgcggct 
tgacagattg 
gtggagcgat 
acgcgacccc 
acccgagatt 
ctacactgac 
accccattcg 
taagtgcggg 
gctactaccg 
cggccctggc 
gtaacaaggt 
gcggcggcgt 
ccgggtcccg 
gtggggtcgg 
ctcgtccggc 
tctttcccgt 
ccgtcccggg 
ccgcggctct 
gtcgcgtgtg 
ctcctgtccc 
gtcgccctgc 
ggcggctctc 
cggtcgtgtg 
gccgccggct 
gtcgttcccg 
gtttcgttcc 
cggggagagg 
ctcacacccg 
aagaacgcag 
tcgaacgcac 
cgatcaatcg 
caacccccca 
cggtgtggcg 
gcttcttccc 
ggcgcctccc 
ccgcgccccc 
gttgagggtg 
gtggcggtcg 
tgcgctgggg 
accctccggc 
tggcgttgcg 
cgccttcgcg 
cttcgtctcc 
ccgcggcgtc 
gggcctcgct 
tgcgtcccgg 



accacgggtg 
acatccaagg 
gaaaaataac 
cctttaacga 
ccaatagcgt 
gcgggcggtc 
cctcgatgct 
agtgttcaaa 
gcggttctat 
attcgtattg 
agcatttgcc 
agataccgtc 
catgacccgc 
tgcaaagctg 
taatttgact 
atagctcttt 
ttgtctggtt 
cgagcggtcg 
gagcaataac 
tggctcagcg 
tgatggggat 
tcataagctt 
attggatggt 
ggagcgctga 
ttccgtaggt 
ggcccgctct 
tcgcccgcgt 
tctgggtccg 
tctgacctcg 
ccggctcttc 
ggcgttcggt 
ggcttttcta 
ggctcgcccg 
gggtacctag 
cgcccccagg 
cctcagactc 
ggggggtgga 
tgcccgattt 
tgtttttccg 
tgctggccgg 
agggcggtgg 
aaataccgat 
ctagctgcga 
ttgcggcccc 
cgtcacccgc 
acccgggtcg 
cgcgcgcccg 
gctccgccgt 
ggaccgctgc 
gtggcgcccg 
tgcgtgcgcc 
acgagggccg 
gaggcggggt 
ttgtgtggag 
agggagggtt 
ccgcacgcgg 
gcttctcctt 
cgtgcgccga 
gacccgttgc 
gggttgcgtg 



acggggaatc 
aaggcagcag 
aatacaggac 
ggatccattg 
atattaaagt 
cgccgcgagg 
cttagctgag 
gcaggcccga 
tttgttggtt 
cgccgctaga 
aagaatgttt 
gtagttccga 
cgggcagctt 
aaacttaaag 
caacacggga 
ctcgattccg 
aattccgata 
gcgtccccca 
aggtctgtga 
tgtgcctacc 
cggggattgc 
gcgttgatta 
ttagtgaggc 
gaagacggtc 
gaacctgcgg 
ccccgtcttg 
gtggagcgag 
tctgggaccg 
ccaccctacc 
cgtgtctacg 
cgtcggggcg 
cgttggctgg 
tcccgatgcc 
ctgtcgcgtt 
gtcggggggc 
catgaccctc 
tgtctggagc 
ccgcgggtcg 
ctcccgaccc 
cctgaggcta 
tcgttggggg 
acgactctta 
gaattaatgt 
gggttcctcc 
tgcggtgggt 
ggccctccgt 
cgtcgcggag 
tcccgccctc 
ctcaccagtc 
ggggtgggcg 
gaggtggtgg 
gtcggtcgcc 
cgaccgctcg 
ggagagcgag 
tggcgtcccg 
ccgctagggg 
cacccgggcg 
tgcgagtcac 
gtcccggctt 
tgagtaagat 
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agggttcgat 
gcgcgcaaat 
tctttcgagg 
gagggcaagt 
tgctgcagtt 
cgagtcaccg 
tgtcccgcgg 
gccgcctgga 
ttcggaactg 
ggtgaaattc 
tcattaatca 
ccataaacga 
ccgggaaacc 
gaattgacgg 
aacctcaccc 
tgggtggtgg 
acgaacgaga 
acttcttaga 
tgcccttaga 
ctgcgccggc 
aattattccc 
agtccctgcc 
cctcggatcg 
gaacttgact 
aaggatcatt 
tgtgtgtcct 
gtgtctggag 
cctccgattt 
gcggcggcgg 
aggggcggta 
cgcgctttgc 
ggcggttgtc 
acgcttttct 
ccggcgcgga 
ggtggggccc 
ctccccccgc 
cccctcgggc 
gtcctgtcgg 
tttttttttc 
cccctcggtc 
actgtgccgt 
gcggtggatc 
gaattgcagg 

cggggctacg 

gctgcgcggc 
ctcccgaagt 
cctggtctcc 
gcccgtgcac 
tttctcggtc 
cgtccgcatc 
tcggtcccct 
tgcggtggtt 
cggggttggc 
ggcgagaacg 
cgtccgtccg 
cggtcggggc 
gtacccgctc 
ccccgggtgt 
ccctgggggg 
cctccacccc 



tccggagagg 
tacccactcc 
ccctgtaatt 
ctggtgccag 
aaaaagctcg 
cccgtccccg 
ggcccgaagc 
taccgcagct 
aggccatgat 
ttggaccggc 
agaacgaaag 
tgccgactgg 
aaagtctttg 
aagggcacca 
ggcccggaca 
tgcatggccg 
ctctggcatg 
gggacaagtg 
tgtccggggc 
aggcgcgggt 
catgaacgag 
ctttgtacac 
gccccgccgg 
atctagagga 
aaacgggaga 
cgccgggagg 
tgaggtgaga 
cccctccccc 
ctgctcgcgg 
cgtcgttacg 
tctcccggca 
gcgtgtgggg 
ggcctcgcgt 
ggtttaagga 
gtagggaagt 
tgccgccgtt 
gccgtggggg 
tgccggtcgt 
ctccccccca 
catctgttct 
cgtcagcacc 
actcggctcg 
acacattgat 
cctgtctgag 
tgggagtttg 
tcagacgtgt 
cccgcgcatc 
cccggtcctg 
ccgtgccccg 
tgctctggtc 
gcggccgcgg 
gtctgtgtgt 
gcggtcgccc 
gagagaggtg 
tccctccctc 
ccgtggcccc 
cggcgccggc 
tgcgagttcg 
gacccggcgt 
cgccgccctc 
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ccctcccgcc ggcctctcgg ggaccccctg agacggttcg ccggctcgtc ctcccgtgcc 13680 
gccgggtgcc gtctctttcc cgcccgcctc ctcgctctct tcttcccgcg gctgggcgcg 13740 
tgtcccccct ttctgaccgc gacctcagat cagacgtggc gacccgctga atttaagcat 13800 
attagtcagc ggaggaaaag aaactaacca ggattccctc agtaacggcg agtgaacagg 13860 
gaagagccca gcgccgaatc cccgocgcgc gtcgcggcgt gggaaatgtg gcgtacggaa 
gacccactcc ccggcgccgc tcgtgggggg cccaagtcct tctgatcgag gcccagcccg 
tggacggtgt gaggccggta gcggccccgg cgcgccgggc tcgggtcttc ccggagtcgg 
gttgcttggg aatgcagccc aaagcgggtg gtaaactcca tctaaggcta aataccggca 
cgagaccgat agtcaacaag taccgtaagg gaaagttgaa aagaactttg aagagagagt 
tcaagagggc gtgaaaccgt taagaggtaa acgggtgggg tccgcgcagt ccgcccggag 14220 
gattcaaccc ggcggcgcgc gtccggccgt gcccggtggt cccggcggat ctttcccgct 14280 
ccccgttcct cccgacccct ccacccgcgc gtcgttcccc tcttcctccc cgcgtccggc 
gcctccggcg gcgggcgcgg ggggtggtgt ggtggtggcg cgcgggcggg gccgggggtg 
gggtcggcgg gggaccgccc ccggccggcg accggccgcc gccgggcgca cttccaccgt 
ggcggtgcgc cgcgacoggc tccgggacgg ccgggaaggc ccggtgggga aggtggctcg 
qggggggcgg cgcgtctcag ggcgcgccga accacctcac cccgagtgtt acagccctcc 
ggccgcgctt tcgccgaatc ccggggccga ggaagccaga tacccgtcgc cgcgctctcc 
ctctcccccc gtccgcctcc cgggcgggcg tgggggtggg ggccgggccg cccctcccac 
ggcgcgaccg ctctcccacc cccctccgtc gcctctctcg gggcccggtg gggggcgggg 14760 
cggactgtcc ccagtgcgcc ccgggcgtcg tcgcgccgtc gggtcccggg g^oogtog 14820 
gtcacgcgtc tcccgacgaa gccgagcgca cggggtcggc ggcgatgtcg gctacccacc 14 880 
cgacccgtct tgaaacacgg accaaggagt ctaacgcgtg cgcgagtcag gggctcgtcc 
gaaagccgcc gtggcgcaat gaaggtgaag ggccccgccc gggggcccga ggtgggatcc 
cgaggcctct ccagtccgcc gagggcgcac caccggcccg tctcgcccgc cgcgccgggg 
agg?ggagca cgagcgtacg cgttaggacc cgaaagatgg tgaactatgc ttgggcaggg 15120 
cgaagccaga ggaaactctg gtggaggtcc gtagcggtcc tgacgtgcaa atcggtcgtc 15180 
cgacJtgggt ataggggcga aagactaato gaaccatcta gtagctggtt ccctccgaag 15240 
tttccctcag gatagctggc gctctcgctc ccgacgtacg cagttttatc cggtaaagcg 
aatgattaga ggtcttgggg ccgaaacgat ctcaacctat tctcaaactt taaatgggta 
agaagcccgg ctcgctggcg tggagccggg cgtggaatgc gagtgcctag tgggccactt 
ttggtaagca gaactggcgc tgcgggatga accgaacgcc gggttaaggc gcccgatgcc 
gacgctcatc agaccccaga aaaggtgttg gttgatatag acagcaggac ggtggccatg 15540 
gaagtcggaa tccgctaagg agtgtgtaac aactcacctg ccgaatcaac tagccctgaa 15600 
aatggatggc gctggagcgt cgggcccata cccggccgtc gccgcagtcg gaacggaacg 15660 
ggacgggagc ggccgcgggt gcgcgtctct cggggtcggg ggtgcgtggc gggggcccgt 
cccccgcctc ccctccgcgc gccgggttcg cccccgcggc gtcgggcccc gcggagccta 
cgccgcgacg agtaggaggg ccgctgcggt gagccttgaa gcctagggcg cgggcccggg 
tggaqccgcc gcaggtgcag atcttggtgg tagtagcaaa tattcaaacg agaactttga 
aqgccgaagt ggagaagggt tccatgtgaa cagcagttga acatgggtca gtcggtcctg 
agagatgggc gagtgccgtt ccgaagggac gggcgatggc ctccgttgcc ctcggccgat 
cgaaagggag tcgggttcag atccccgaat ccggagtggc ggagatgggc gccgcgaggc 
cagtgcggta acgcgaccga tcccggagaa gccggcggga ggcctcgggg agagttctct 
tttctttgtg aagggcaggg cgccctggaa tgggttcgcc ccgagagagg ggoccgtgcc 
ttggaaagcg tcgcggttcc ggcggcgtcc ggtgagctot cgctggccct tgaaaatccg 
ggggagaggg tgtaaatctc gcgccgggcc gtacccatat ocgcagcagg tctccaaggt 
aaacagcctc tggcatgttg gaacaatgta ggtaagggaa gtcggcaagc cggatccgta 
acttcaggat aaggattggc tctaagggct gggtcggtcg ggctggggcg cgaagcgggg 
ctgggcgcgc gccgcggctg gacgaggcgc cgccgccctc tcccacgtcc ggggagaccc 
cccgtccttt ccgcccgggc ccgccctccc ctcttccccg cggggccccg tcgtcccccg 
cgtcgtcgcc acctctcttc ccccctcctt cttcccgtcg gggggcgggt cgggggtcgg 
cgcgcggcgc gggctccggg gcggcgggtc caaccccgcg ggggttccgg agcgggagga 
accagcggtc cccggtgggg cggggggccc ggacactcgg ggggccggcg gcggcggcga 
ctctggacgc gagccgggcc cttcccgtgg atcgcctcag ctgcggcggg cgtcgcggcc 
gctcccgggg agccoggcgg gtgccggcgc gggtcccctc cocgcggggc ctcgctccac 
ccccccatcg cctctcccga ggtgcgtggc gggggcgggc gggcgtgtcc cgcgcgtgtg 
gggggaacct ccgcgtcggt gttcccccgc cgggtccgcc ccccgggccg cggttttccg 
cgcggcgccc ccgcctcggc cggcgcctag cagccgactt agaactggtg cggaccaggg 
gaatccgact gtttaattaa aacaaagcat cgcgaaggcc cgcggcgggt gttgacgcga 
tgtgatttct gcccagtgct ctgaatgtca aagtgaagaa attcaatgaa gcgcgggtaa 
acggcgggag taactatgac tctcttaagg tagccaaatg cctcgtcatc taattagtga 
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cgcgcatgaa 
gccaagggaa 
agtctggcac 
gccccgtcct 
ctactctcat 
cgcttctggc 
agtgccaggt 
taaggcgagc 
atcttgattt 
tttgggtttt 
ccaagcgttc 
gaagcagaat 
tagaccgtcg 
cctgctcagt 
gccaatgggg 
gcccaagcgg 
ccccgtccgt 
ccgccgggcg 
cggccggaaa 
ggcgctaaac 
gctccctcgc 
tttcccgtcg 
gcggtcgcct 
cgtcttctcc 
tgggtgtggg 
cttgccctcc 
ccgcggcgcg 
gttggagggg 
gccggggggg 
gcggcggcga 
gcggcgcctc 
actttgtttt 
cccccccccc 

tttttttttt 

catataggtc 
gaccagatat 
tttttttttt 
atatacttat 
tccaccgatg 
ccgcgacgcg 
tggaacctta 
tactttgtct 
ataaattatc 
tttgtgttgt 
tttgtgttgt 
gttgggttgg 
ttgtttgctg 
tacacaaaca 
tatccctttc 
tgtgtgtgtg 
tacttataat 
gcagacttct 
aataaataca 
gttgaccagt 
aatagataga 
attaaccact 
atttgaactc 
tggtctgtct 
tgcttttttt 
tctgtagacc 



tggatgaacg 
cgggcttggc 
ggtgaagaga 
cgcgtcgggg 
cgttttttca 
gccaagcgtc 
ggggagtttg 
tcagggagga 
tcagtacgaa 
aagcaggagg 
atagcgacgt 
tcaccaagcg 
tgagacaggt 
acgagaggaa 
cgaagctacc 
aacgatacgg 
cccgctcggc 
tcgggaccgg 
gggggccgcc 
cattcgtaga 
tgcgatctat 
cacgcccgct 
cggcccccgc 
tccgtctccc 
agcctcgtgc 
ggccttggcc 
gtgacgcacg 
cgggaggggt 
cgctctctcc 
cgtgcgtacg 
ttccattttt 
ttttttttcc 
ccccccggcg 
ttaaattcct 
gaccagtact 
ccgaaagtcc 
tttggtgtgc 
aggaggaggt 
atggaggtcg 
gcgggctcac 
aggtcgacca 
ttttctgaaa 
tgatctagat 
tttgttttgt 
gttgtgttgt 
gttgggttgt 
ttgttttgtg 
tgcacttttt 
cttctctctc 
tgcgtgtgtg 
aataggtcgc 
gagttcgagg 
tacatacata 
tgtcaatcct 
tggatagagt 
tttccctttt 
aggaccctgg 
gctgtttgtt 
tttcttctga 
agcctggcct 



agattcccac 
ggaatcagcg 
catgagaggt 
tcggggcacg 
ctgacccggt 
cgtcccgcgc 
actggggcgg 
cagaaacctc 
tacagaccgt 
tgtcagaaaa 
cgctttttga 
ttggattgtt 
tagttttacc 
ccgcaggttc 
atctgtggga 
cagcgccgaa 
ggggtccccg 
ggtccggtgc 
ctctcgcccg 
cgacctgctt 
tgaaagtcag 
cgctcgcacg 
gcggttgccc 
gaggacggtt 
cgtcgcgacc 
aagccggagg 
gtgggatccc 
ttttcccgtg 
gcccgagcat 
aggggaggat 
tcccccccaa 
cccgatgctg 
cggagcggcg 
ggaaccttta 
ccgggtggta 
tctctttccc 
ctctttttga 
cgaccagtac 
accagatgtc 
tctggactct 
gttgtccgtc 
atcgcagagg 
ttgtttttct 
tttgttttgt 
gttgtgttgg 
gttgtttggt 
ttttgcgggt 
ttaaaataaa 
ttttttaaaa 
tgtgtgtgtg 
cgggtggtgg 
ccagcctggt 
catacataca 
ttagaatttt 
gatacaaata 
taggtttttt 
caggtcaact 
tgtttgcttg 
gacagtattt 
caatcgaact 



tgtccctacc 
gggaaagaag 
gtagaataag 
ccggcctcgc 
gaggcggggg 
gtgcgggcgg 
tacacctgtc 
ccgtggagca 
gaaagcgggg 
gttaccacag 
tccttcgatg 
cacccactaa 
ctactgatga 
agacatttgg 
ttatgactga 
ggagcctcgg 
cgtcgccccg 
ggagagccgt 
tcacgttgaa 
ctgggtcggg 
ccctcgacac 
cgaccgtgtc 
gaacgaccgt 
cgtttctctt 
gcggcctgcc 
gcggaggagg 
catcctcggc 
aacgccgcgt 
ccccactccc 
gtcgcggtgt 
cttcggaggt 
gaggtcgacc 
gggccactct 
ggtcgaccag 
ctttgtcttt 
tttactcttc 
cttatataca 
tccgggcgac 
cgaaagtgtc 
tttttttttt 
tttcactcat 
tcgaccagat 
gtttttcagt 
tttgttttgt 
gttgggttgg 
tttgtgttgt 
cgaacagttg 
tttttaaaat 
attttctttg 
cgtgcagcgt 
tagcttcccg 
ctacagagga 
tacatacata 
gtttttaatt 
taggtttttt 
tttttttccc 
ggaaaacgtg 
cttgcttgct 
ctctgtgtaa 
cagaaatcct 



tactatccag 
accctgttga 
tgggaggccc 
gggccgccgg 
ggcgagcccc 
gcgcgacccg 
aaacggtaac 
gaagggcaaa 
cctcacgatc 
ggataactgg 
tcggctcttc 
tagggaacgt 
tgtgttgttg 
tgtatgtgct 
acgcctctaa 
ttggccccgg 
cggcggcgcg 
tcgtcttggg 
cgcacgttcg 
gtttcgtacg 
aagggtttgt 
gccgcccggg 
gtggtggttg 
tccccttccg 
gtcgcctgcc 
gggatcggcg 
gcgtccgtcg 
tcggcgccag 
gcccctcctc 
ggaggcggag 
cgaccagtac 
agatgtccga 
ggactctttt 
ttgtccgtct 
ttctgaaaat 
cccacagcga 
tgtaaatagt 
actttgtttt 
ccgtcccccc 
tttttttttt 
tcatataggt 
gtcagaaagt 
tttgtgttgt 
tttgttttgt 
gttgggttgg 
ttggtgttgt 
tccctaaccg 
aaatgcgaaa 
tgtgtgtgtg 
gcgcgcgctc 
gactccagag 
accctgtctc 
catacataca 
aatgtgatag 
tttcagtaaa 
ctgtccatgt 
ttttctatat 
tgcttgcttg 
cctggtgccc 
cctgcctctt 



cgaaaccaca 
gcttgactct 
ccggcgcccg 
tgaaatacca 
gaggggctct 
ctccggggac 
gcaggtgtcc 
agctcgcttg 
cttctgacct 
cttgtggcgg 
ctatcattgt 
gagctgggtt 
ccatggtaat 
tggctgagga 
gtcagaatcc 
atagccgggt 
gggtctcccc 
aaacggggtg 
tgtggaacct 
tagcagagca 
ctctgcgggc 
cgtcacgggg 

ggggggggat 

tcgctctcct 
gccgcagccc 
gcggcggcga 
gggacggccg 
gcctctggcg 
ttcgcgcgcc 
agggtccggc 
tccgggcgac 
aagtgtcccc 
tttttttttt 
tttactcctt 
cccagaggtc 
ttctcttttt 
gtgtacgttt 
tttttttttt 
cctccccccc 
tttaaatttc 
cgaccggtgg 
ctggtggtcg 
tttgtgttgt 
tttgttttgt 
gttgggttgg 
tggttttgtt 
agtttttttg 
atcgaccaat 
tgtgtgtgtg 
gttttataaa 
gcagaggcag 
gaaaaatgaa 
tacatatgag 
agagatagat 
tatgaggttg 
ggttgctggg 
atataaatag 
cttgcttgct 
tgaaactcac 
gtctacctcc 
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caattttgga gtaaaggtgt gctacaccac tgcctggcat tattatcatt atcattatta 20880 

attttattat tagacagaac gaaatcaact agttggtcct gtttcgttaa ttcatttgaa 2094U 

attagttgga ccaattagtt ggctggtttg ggaggtttct tttgtttccg atttgggtgt 21Q0U 

ttgtggggct ggggatcagg tatctcaacg gaatgcatga aggttaaggt gagatggctc 2lUbu 

gatttttgta aagattactt ttcttagtct gaggaaaaaa taaaataata ttgggctacg ^ll^u 

tttcattgct tcatttctat ttctctttct ttctttcttt ctttcagata aggaggtcgg 21 * 8 ° 

ccagttcctc ctgccttctg gaagatgtag gcattgcatt gggaaaagca ttgtttgaga 21240 

gatgtgctag tgaaccagag agtttggatg tcaagccgta taatgtttat tacaatatag 21300 

aaaagttcta acaaagtgat ctttaacttt tttttttttt tttctccttc tacttctact 21360 

tgttctcact ctgccaccaa cgcgctttgt acattgaatg tgagctttgt tttgcttaac 21420 

agacatatat tttttctttt ggttttgctt gacatggttt ccctttctat ccgtgcaggg 21480 

ttcccagacg gccttttgag aataaaatgg gaggccagaa ccaaagtctt ttgaataaag 21b40 

caccacaact ctaacctgtt tggctgtttt ccttcccaag gcacagatct ttcccagcat 21600 

ggaaaagcat gtagcagttg taggacacac tagacgagag caccagatct cattgtgggt 21bbU 

ggttgtgaac cacccaccat gtggttgcct gggatttgaa ctcaggatct tcagaagacg 21720 

agtcagggct ctaaaccgat gagccatctc tccagccctc ctacattcct tcttaaggca 21780 

tgaatgatcc cagcatggga agacagtctg ccctctttgt ggtatatcac catatactca 21840 

ataaaataat gaaatgaatg aagtctccac gtatttattt cttcgagcta tctaaattct 21900 

ctcacagcac ctccccctcc cccacactgc ctttctccct atgtttgggt ggggctgggg 21960 

gaggggtggg gtgggggcag ggatctgcat gtcttcttgc aggtctgtga actatttgcg 22020 

atggcctggt tctctgaact gttgagcctt gtctatccag aggctgactg gctagttttc 22080 
tacctgaagt ccctgagtga tgatttccct gtgaattc 

<210> 6 

<211> 175 

<212> DNA 

<213> Mus Mus cuius 

<400> 6 ^ a. 4_ 4. 

ctcccgcgcg gcccccgtgt tcgccgttcc cgtggcgcgg acaatgcggt tgtgcgtcca 
cgtgtgcgtg tccgtgcagt gccgttgtgg agtgcctcgc tctcctcctc ctccccggca 
gcgttcccac ggttggggac caccggtgac ctcgccctct tcgggcctgg atccg 

<210> 7 
<211> 755 
<212> DNA 
<213> Mus mus cuius 

<400> 7 . 
ggtctggtgg gaattgttga cctcgctctc gggtgcggcc tttggggaac ggcggggtcg 
gtcgtgcccg gcgccggacg tgtgtcgggg cccacttccc gctcgagggt ggcggtggcg 
gcggcgttgg tagtctcccg tgttgcgtct tcccgggctc ttgggggggg tgccgtcgtt 
ttcggggccg gcgttgcttg gcttacgcag gcttggtttg ggactgcctc aggagtcgtg 
ggcggtgtga ttcccgccgg ttttgcctcg cgtctgcctg ctttgcctcg ggtttgcttg 
gttcgtgtct cgggagcggt ggtttttttt tttttcgggt cccggggaga ggggtttttc 
cgggggacgt tcccgtcgcc ccctgccgcc ggtgggtttt cgtttcgggc tgtgttcgtt 
tccccttccc cgtttcgccg tcggttctcc ccggtcggtc ggccctctcc ccggtcggtc 
gcccggccgt gctgccggac ccccccttct gggggggatg cccgggcacg cacgcgtccg 
ggcggccact gtggtccggg agctgctcgg caggcgggtg agccagttgg aggggcgtca 
tgcccccgcg ggctcccgtg gccgacgcgg cgtgttcttt gggggggcct gtgcgtgcgg 
gaaggctgcg cacgttgtcg gtccttgcga gggaaagagg cttttttttt ttagggggtc 
gtccttcgtc gtcccgtcgg cggtggatcc ggcct 

<210> 8 
<211> 463 
<212> DNA 
<213> Mus musculus 

<400> 8 

ggccgaggtg cgtctgcggg ttggggctcg tccggccccg tcgtcctccg ggaaggcgrr 

8 



60 
120 
175 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
755 



60 



WO 03/093469 



PCT/CA03/00645 



tagcgggtac 
ttctggttgt 
caggtcagcc 
agcgagcccg 
ccccggggtt 
ctggttccgg 
cgggtctccc 



cgtcgccgcg 
tggcggcggg 
tccgcctgtg 
tccgttcgac 
ttcacggcgc 
tctccccgcc 
aacccccggc 



ccgaggtggg 
ggctccggtc 
ggcttcgtcg 
cttccttccg 
cccccacgct 
aaaccccggt 
cggaagggtt 



<210> 9 

<211> 378 

<212> DNA 

<213> Mus musculus 



cgcacgtcgg 
gatgtcttcc 
gccgtctccc 
ccttcccccc 
cctccgcctc 
tgggttggtc 
cgggggttcc 



tgagataacc 
cctccccctc 
cccccctcac 
atctttccgc 
tccgcccgtg 
tccggccccg 

ggg 



ccgagcgtgt 
tccccgaggc 
gtccctcgcg 
gctccgttgg 
gtttggacgc 
gcttgctctt 



120 
180 
240 
300 
360 
420 
463 



<400> 9 

ggattcttca 

ggcggccccg 

cggcgacgac 

catggtgacc 

ggctaccaca 

tagtgacgaa 

tttaaatcct 



ggattgaaac 
ggcggtttgg 
ccattcgaac 
acgggtgacg 
tccaaggaag 
aaataacaat 
ttaagcag 



ccaaaccggt 
tgagttagat 
gtctgcccta 
gggaatcagg 
gcagcaggcg 
acaggactct 



tcagtttcct 
aacctcgggc 
tcaactttcg 
gttcgattcc 
cgcaaattac 
ttcgaggccc 



ttccggctcc 
cgatcgcacg 
atggtagtcg 
ggagagggag 
ccactcccga 
tgtaattgga 



ggccgggggg 
ccccccgtgg 
atgtgcctac 
cctgagaaac 
cccggggagg 
atgagtccac 



60 
120 
180 
240 
300 
360 
378 



<210> 10 
<211> 378 
<212> DNA 

<213> Mus musculus 



<400> 10 

gatccattgg 

tattaaagtt 

gccgcgaggc 

ttagctgagt 

aagcaggccc 

cctattttgt 

ccttattgcg 



agggcaagtc 
gctgcagtta 
gagtcaccgc 
tgtcccgcgg 
gagccgcctg 
ttggttttcg 
ccccccta 



tggtgccagc 
aaaagctcgt 
ccgtccccgc 
ggcccgaagc 
gataccgcca 
gaactgagcc 



<210> 11 

<211> 719 

<212> DNA 

<213> Mus musculus 



agccgcggta 
agttggatct 
cccttgcctc 
gtttactttg 
gctaggaaat 
catgattaag 



attccagctc 
tgggagcggg 
tcggcgcccc 
aaaaaattag 
aatggaatag 
ggaaacggcc 



caatagcgta 
cgggcggtcc 
ctcgatgctc 
agttgtttca 
gaccgcggtt 
gggggcattc 



60 
120 
180 
240 
300 
360 
378 



<400> 11 

ggatctttcc 

tcccctctcc 

gggggaccgc 

gccgcgaccg 

ggcgcgtctc 

tttcgcggaa 

ccccgtccgc 

ggtttctctc 

ggactgtcct 

tcacgccgcc 

gacccgtctt 

aaagccgccg 



cgctccccgt 
ggaggggggg 
ccccggccgg 
gctacgagac 
agggcgcgcc 
tcccggggcc 
ctcccgggcg 
tctcccggtc 
cagtgcgccc 
cccgacgaag 
gaaacacgga 
tggcgcaatg 



tcctcccggc 
gaggtggggg 
caaaaggccg 
ggctgggaag 
gaaccacctc 
gaggggaagc 
ggcgtggggg 
tcggccggtt 
cgggcgtcgt 
ccgagcgcac 
ccaaggagtc 
aaggtgaagg 



ccctccaccc 
cgcgtgggcg 
ccgccgggcg 
gcccgacggg 
accccgagtg 
ccgatacccg 
tgggggccgg 
tggggggggg 
cgcgccgtcg 
ggggtcggcg 
taacgcgtgc 
gccccgtccg 



<210> 12 

<211> 685 

<212> DNA 

<213> Mus musculus 



gcgcgtctcc 
gggtcggggg 
cacttcaacc 
gaatgtggct 
ttacagccct 
tcgccgcgct 
gccgcccctc 
agcccggttg 
ggcccggggg 
gcgatgtcgg 
gcgagtcagg 
ggggcccgag 



ccccttcttt 
tggggtcggc 
gtagcggtgc 
cggggggggc 
ccggccgcgc 
tttcccctcc 
ccacgcccgt 
ggggcggggc 
gttctctcgg 
ctacccaccc 
ggctcgcacg 
gtgggatcc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
719 
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<400> 12 

cgaggcctct ccagtccgcc gagggcgcac caccggcccg tctcgcccgc cgcgtcgggg 
aggtggagca cgagcgtacg cgttaggacc cgaaagatgg tgaactatgc ctgggcaggg 
cgaagccaga ggaaactctg gtggaggtcc gtagcggtcc tgacgtgcaa atcggtcgtc 
cgacctgggt ataggggcga aagactaatc gaaccatcta gtagctggtt ccctccgaag 
tttccctcag gatagctggc gctctcgcaa ccttcggaag cagttttatc cgggtaaagg 
cggaatggat taggaggtct tggggccgga aacgatctca aactatttct caaactttaa 
atgggtaagg aagcccggct cgctggcgtg gagccgggcg tggaatgcga gtgcctagtg 
ggccactttt ggtaagcaga actggcgctg cgggatgaac cgaacgccgg gttaaggcgc 
ccgatgccga cgctcatcag accccagaaa aggtgttggt tgatatagac agcaggacgg 
tggccatgga agtcggaatc cgctaaggag tgtgtaacaa ctcacctgcc gaatcaacta 
gccctgaaaa tggatggcgc tggagcgtcg ggcccatacc cggccgtcgc cggcagtcgg 
aacgggacgg gacgggagcg gccgc 

<210> 13 
<211> 5162 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Chimeric bacterial plasmid 

gacggatcgg gagatctccc gatcccctat ggtcgactct cagtacaatc tgctctgatg 
ccgcatagtt aagccagtat ctgctccctg cttgtgtgtt ggaggtcgct gagtagtgcg 
cgagcaaaat ttaagctaca acaaggcaag gcttgaccga caattgcatg aagaatctgc 
ttagggttag gcgttttgcg ctgcttcgcg atgtacgggc cagatatacg cgttgacatt 
gattattgac tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata 
tggagttccg cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc 
cccgcccatt gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc 
attgacgtca atgggtggac tatttacggt aaactgccca cttggcagta catcaagtgt 
atcatatgcc aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt 
atgcccagta catgacctta tgggactttc ctacttggca gtacatctac gtattagtca 
tcgctattac catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg 
actcacgggg atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc 
aaaatcaacg ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg 
gtaggcgtgt acggtgggag gtctatataa gcagagctct ctggctaact agagaaccca 
ctgcttactg gcttatcgaa attaatacga ctcactatag ggagacccaa gcttggtacc 
gagctcggat cgatatctgc ggccgcgtcg acggaattca gtggatccac tagtaacggc 
cgccagtgtg ctggaattaa ttcgctgtct gcgagggcca gctgttgggg tgagtactcc 
cLtcaaaag cgggcatgac ttctgcgcta agattgtcag tttccaaaaa ^gaggaggat 
ttgatattca cctggcccgc ggtgatgcct ttgagggtgg ccgcgtccat ctggtcagaa 
aagacaatct ttttgttgtc aagcttgagg tgtggcaggc ttgagatctg gccatacact 
tgagtgacaa tgacatccac tttgcctttc tctccacagg tgtccactcc caggtccaac 
tgcaggtcga gcatgcatct agggcggcca attccgcccc tctccctccc ccccccctaa 
cgttaitggc cgaagccgct tggaataagg ccggtgtgcg tttgtctata Jgtgattttc 
caccatattg ccgtcttttg gcaatgtgag ggcccggaaa cctggccctg Jcttcttgac 
gagcattcct aggggtcttt cccctctcgc caaaggaatg caaggtctgt tgaatgtcgt 
gaaggaagca gttcctctgg aagcttcttg aagacaaaca acgtctgtag cgaccctttg 
caggcagcgg aaccccccac ctggcgacag gtgcctctgc ggccaaaagc cacgtgtata 
agatacacct gcaaaggcgg cacaacccca gtgccacgtt gtgagttgga tagttgtgga 
aagagtcaaa tggctctcct caagcgtatt caacaagggg ctgaaggatg cccagaaggt 
accccattgt atgggatctg atctggggcc tcggtgcaca tgctttacat gtgtttagtc 
gaggttaaaa aaacgtctag gccccccgaa ccacggggac gtggttttcc tttgaaaaac 
acgatgataa gcttgccaca acccgggatc caccggtcgc caccatggtg agcaagggcg 
aggagctgtt caccggggtg gtgcccatcc tggtcgagct ggacggcgac gtaaacggcc 
acaagttcag cgtgtccggc gagggcgagg gcgatgccac ctacggcaag ctgaccctga 
agttcatctg caccaccggc aagctgcccg tgccctggcc caccctcgtg accaccctga 
cctacggcgt gcagtgcttc agccgctacc ccgaccacat gaagcagcac gacttcttca 
agtccgccat gcccgaaggc tacgtccagg agcgcaccat cttcttcaag gacgacggca 
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60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
685 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
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2280 
2340 
2400 
2460 
2520 
2580 
2640 



2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 



3300 
3360 
3420 
3480 
3540 
3600 



actacaagac ccgcgccgag gtgaagttcg agggcgacac cctggtgaac cgcatcgagc 
tgaagggcat cgacttcaag gaggacggca acatcctggg gcacaagctg gagtacaact 
acaacagcca caacgtctat atcatggccg acaagcagaa gaacggcatc aaggtgaact 
tcaagatccg ccacaacatc gaggacggca gcgtgcagct cgccgaccac taccagcaga 
acacccccat cggcgacggc cccgtgctgc tgcccgacaa ccactacctg agcacccagt 
ccgccctgag caaagacccc aacgagaagc gcgatcacat ggtcctgctg gagttcgtga 
ccgccgccgg gatcactctc ggcatggacg agctgtacaa gtaaagcggc cctagagctc 
gctgatcagc ctcgactgtg cctctagttg ccagccatct gttgtttgcc cctcccccgt 2700 
gccttccttg accctggaag gtgccactcc cactgtcctt tcctaataaa atgaggaaat ?7 60 
tgcatcgcat tgtctgagta ggtgtcattc tattctgggg ggtggggtgg ggcaggacag 
caagggggag gattgggaag acaatagcag gcatgctggg gatgcggtgg gctctatggc 
ttctgaggcg gaaagaacca gctggggctc gagtgcattc tagttgtggt ttgtccaaac 
tcatcaatgt atcttatcat gtctgtatac cgtcgacctc tagctagagc ttggcgtaat 
catggtcata gctgtttcct gtgtgaaatt gttatccgct cacaattcca cacaacatac 
gagccggaag cataaagtgt aaagcctggg gtgcctaatg agtgagctaa ctcacattaa 
ttgcgttgcg ctcactgccc gctttccagt cgggaaacct gtcgtgccag ctgcattaat 
gaatcggcca acgcgcgggg agaggcggtt tgcgtattgg gcgctcttcc ^ttcctcgc 3240 
tcactgactc gctgcgctcg gtcgttcggc tgcggcgagc ggtatcagct cactcaaagg 3300 
cggtaatacg gttatccaca gaatcagggg ataacgcagg aaagaacatg tgagcaaaag 
gccagcaaaa ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc cataggctcc 
gcccccctga cgagcatcac aaaaatcgac gctcaagtca gaggtggcga aacccgacag 
gactataaag ataccaggcg tttccccctg gaagctccct cgtgcgctct cctgttccga 
ccctgccgct taccggatac ctgtccgcct ttctcccttc gggaagcgtg gcgctttctc 
aatgctcacg ctgtaggtat ctcagttcgg tgtaggtcgt tcgctccaag ctgggctgtg 3660 
tgcacgaacc ccccgttcag cccgaccgct gcgccttatc cggtaactat cgtcttgagt 3720 
ccaacccggt aagacacgac ttatcgccac tggcagcagc cactggtaac aggattagca 
gagcgaggta tgtaggcggt gctacagagt tcttgaagtg gtggcctaac tacggctaca 
ctagaaggac agtatttggt atctgcgctc tgctgaagcc agttaccttc ggaaaaagag 
ttggtagctc ttgatccggc aaacaaacca ccgctggtag cggtggtttt tttgtttgca 
agcagcagat tacgcgcaga aaaaaaggat ctcaagaaga tcctttgatc ttttctacgg 
ggtctgacgc tcagtggaac gaaaactcac gttaagggat tttggtcatg agattatcaa 
aaaggatctt cacctagatc cttttaaatt aaaaatgaag ttttaaatca atctaaagta 
tatatgagta aacttggtct gacagttacc aatgcttaat cagtgaggca cctatctcag 
cgatctgtct atttcgttca tccatagttg cctgactccc cgtcgtgtag ataactacga 
tacgggaggg cttaccatct ggccccagtg ctgcaatgat accgcgagac ccacgctcac 
cggctccaga tttatcagca ataaaccagc cagccggaag ggccgagcgc agaagtggtc 
ctgcaacttt atccgcctcc atccagtcta ttaattgttg ccgggaagct agagtaagta 
gttcgccagt taatagtttg cgcaacgttg ttgccattgc tacaggcatc gtggtgtcac 
gctcgtcgtt tggtatggct tcattcagct ccggttccca acgatcaagg cgagttacat 
gatcccccat gttgtgcaaa aaagcggtta gctccttcgg tcctccgatc gttgtcagaa 
gtaagttggc cgcagtgtta tcactcatgg ttatggcagc actgcataat tctcttactg 
tcatgccatc cgtaagatgc ttttctgtga ctggtgagta ctcaaccaag tcattctgag 
aatagtgtat gcggcgaccg agttgctctt gcccggcgtc aatacgggat aataccgcgc 
cacatagcag aactttaaaa gtgctcatca ttggaaaacg ttcttcgggg cgaaaactct 
caaggatctt accgctgttg agatccagtt cgatgtaacc cactcgtgca cccaactgat 
cttcagcatc ttttactttc accagcgttt ctgggtgagc aaaaacagga aggcaaaatg 
ccgcaaaaaa gggaataagg gcgacacgga aatgttgaat actcatactc ttcctttttc 
aatattattg aagcatttat cagggttatt gtctcatgag cggatacata tttgaatgta 
tttagaaaaa taaacaaata ggggttccgc gcacatttcc ccgaaaagtg ccacctgacg 
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